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FOREWORD 


This  report  was  prepared  by  the  Auerbach  Corporation,  Philadelphia,  Pennsylvania, 
under  Contract  Number  AF30(  602) -3564;  Project  Number  4594  and  Task  Number 
459402.  The  RADC  project  engineer  is  Ronald  J.  Ferris,  EMIID. 

This  is  a  final  report  and  covers  the  period  of  work  from  October  1964  to  September 
1965;  the  originator’s  report  number  is  1231-TR2. 

The  results  of  this  study  are  contained  in  two  volumes: 

Volume  I  presents  in  summary  form,  an  introduction,  findings,  conclusions,  and 
recommendations.  In  addition,  this  volume  contains  a  discussion  on  associative 
memories  and  the  comparative  analyses  of  the  associative  memory  configurations 
studied.  A  short  critique  of  the  Goodyear  Associative  Processor  and  a  bibliography 
are  also  given  in  this  volume. 

Volume  II  contains  the  detailed  discussion  of  the  study.  This  volume  specifies  the 
problem  of  "how"  the  study  was  conducted .  It  presents  the  technical  details ,  the 
relevant  flowcharts,  algorithms,  and  encoding.  Each  section  of  this  volume  contains 
information  pertinent  to  the  approach,  technical  details,  and  the  findings  and  con¬ 
clusion.  Supplementary  information  relevant  to  the  study  is  contained  in  the  appen¬ 
dices  . 


The  principal  investigators  on  this  study  were  Mr.  James  Dugan,  Mr.  Robert 
Shaw  Green  (Project  Engineer  -  hardware  related  studies) ,  Mrs.  Renee  Jasper, 

Mr.  Marvin  Katz,  Dr.  Jack  Minker  (Program  Manager) ,  Mr.  James  Mulford,  and 
Mr.  Warren  E.  Shindle  (Project  Engineer  -  problem  analysis  and  programming  related 
studies) .  The  authors  would  like  to  express  their  appreciation  to  the  numerous  mem¬ 
bers  of  the  AUERBACH  Technical  Staff  who  "  ere  called  upon  for  expert  advice  on 
specialized  areas  of  study.  These  are:  Mr.  George  Neborak,  Mr.  Sheldon  Einhorn, 
Mr.  Don  Russell,  Mr.  Ken  Rose,  Mr.  Lawrence  Feidelman,  Mr.  Ralph  Howe,  Mr. 
Arthur  Hughes,  Mr.  Stephen  Leibholz,  Mr.  Lester  Probst,  Mr.  Robert  Rossheim,  and 
Dr.  Jerome  Sable. 


Release  of  subject  report  to  the  general  public  is  prohibited  by  the  Strategic  Trade  Control 
Program, Mutgol  Defense  Assistance  Control  Listfrevised  6  Jon.  65)  published  by  the 
Department  of  State. 
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ABSTRACT 


The  objective  of  this  effort  wus  to  determine  the  effect  of  associative  memories 
which  are  realizable  today  to  aid  in  processing  formatted  record  problems.  The 
evaluation  consisted  of  a  comparison  between  the  CDC  1604B  and  the  CDC  I604B- 
Associative  Memory  to  process  the  same  problem.  The  Goodyear  Aerospace  Cor¬ 
poration  associative  memory  was  used  to  establish  state-of-the-art  in  associative 
memories ,  however ,  other  associative  memory  designs  were  investigated . 
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EVALUATION 


I.  Study  Objective: 

The  objective  of  this  work  was  to  determine  the  value  of  state-of-the-art  associative 
memories  to  more  efficient  execution  of  typical  non-numeric  and  intelligence  problems 
by  the  general  purpose  computer .  Thus  it  was  not  the  intent  to  provide  an  absolute 
figure  of  merit  which  would  set  the  value  of  small  associative  memories.  Nor  was 
it  the  intent  to  evaluate  the  full  range  of  memory  logic  design  or  nature  of  connection 
to  the  computer  which  an  associative  memory  device  may  take.  It  was  hoped  however, 
that  a  useful  reference  point  would  be  established  with  which  to  guide  further  studies. 

It  should  be  noted  that  these  studies  are  the  first  significant  attempts  to  evaluate 
an  associative  memory  in  a  system  context;  *  i.e. ,  the  associative  memory  is  con¬ 
sidered  in  association  with  a  general  purpose  computer,  typical  peripheral  hardware, 
and  system  programming. 

II.  Approach: 

A.  Selection  of  CDC  1604B  and  CDC  818  Disc 

At  the  outset  it  was  realized  that  the  CDC  1604J3  computer,  the  CDC  818  disc  and  the 
Goodyear  associative  processor  might  not  result  in  an  optimum  combination  for  the 
problems  selected.  Rather,  the  selection  was  based  on  the  following: 

1.  The  CDC  1604B  is  representative  of  "second  generation"  computers 
which  are  the  most  prominent  in  Air  Force  use  today.  Moreover,  due  to  the 
high  cost  of  replacing  computer  systems ,  it  may  be  some  time  before 
present  systems  are  replaced.  Consequently,  considerable  savings  might 
be  realized  from  determining  how  these  computers  could  be  used  more  ef¬ 
ficiently. 

2.  RADC  recognized  that  there  are  several  factors  in  addition  to  the  choice 
of  an  associative  memory  and  a  general  purpose  computer  which  affect 
study  results.  These  factors  are  associated  with  the  software  and  with  file 
structure,  query  statements,  etc.  thus  by  selecting  the  CDC  1604B  and  the 
CDC  818  disc,  we  provide  the  capability  to  modify  the  software  and  gain  a 
more  thoroutf*  understanding  of  the  role  of  software  in  evaluating  associa¬ 
tive  memories. 


♦See  Mr.  Ronald  Ferris'  comments  in  Volume  II  of  the  International  Federation  for 
Information  Processing  1965  proceedings  under  the  panel  session  on  Content  Address¬ 
able  Memories. 


3.  Since  the  hardware  is  available  at  RA^r ,  the  software  programs  de¬ 
veloped  can  be  implemented  and  exercised  in  order  to  provide  empirical  data 
with  a  minimum  of  cost  and  time. 

It  should  be  emphasized  that  the  selection  of  the  CDC  1604B  computer  and  CDC  818  disc 
does  not  affect  the  accuracy  of  the  comparison  made  between  the  associative  memory 
configuration  and  the  non -associative  memory  configuration,  since  in  both  cases  the 
same  general  purpose  computer  and  disc  were  used.  In  fact,  this  is  the  type  of  data 
that  is  of  interest  to  us . 

B.  Selection  of  Goodyear  Associative  Processor  (GAP) 

The  primary  intent  of  this  effort  was  to  evaluate  the  GAP  merely  as  a  first  step  in 
a  long  range  plan  to  evaluate  associative  memories.  Consequently,  whether  or  not 
the  GAP  is  optimally  designed  for  a  specific  task  is  relatively  unimportant.  Instead 
it  should  be  viewed  as  a  research  tool  for  use  in  obtaining  valuable  information  per¬ 
taining  to  associative  processing,  particularly  in  the  area  of  software.  Accordingly 
the  GAP  design  was  selected  with  the  intent  to  evaluate  a  state-of-the-art  product  with 
reasonable  cost.  Finally,  it  should  be  emphasized  that  evaluation  of  GAP  cannot  be 
construed  as  an  evaluation  of  Goodyear  Aerospace's  capability  to  design  computers. 

C.  Selection  of  Problems 

The  problems  used  throughout  the  study  are  actually  processing  functions.  The  intent 
was  to  choose  a  set  of  processing  functions  common  to  many  specific  problems  and  in 
so  doing  obtain  generalizable  results.  Hence,  the  term  sea  surveillance  merely  in¬ 
dicates  the  source  of  the  processing  functions  employed. 

Hie  selection  of  problems  is  a  very  important  step  and  should  not  be  ta \en  lightly. 

The  approach  utilized  in  this  study  was  briefly  the  following:  * 

1.  Identify  a  class  of  problems  within  the  Air  Force 

2.  Identify  the  processing  functions  within  that  class  and  select  the  most 
common. 

3.  Based  on  a  set  of  criteria;  i.e.,  priority,  availability  of  data  base.  etc. 
a  final  set  of  processing  functions  were  selected.  The  criteria  used  in  this 
analysis  is  as  follows: 


•This  work  was  perfotmed  in-the-house  at  RADC.  The  details  of  this  work  are  con¬ 
tained  in  a  report  located  at  RADC. 


x 


a.  Does  the  product  satisfy  an  operational  requirement? 

( 1)  Has  a  machine  program  been  written  to  produce  it? 

(2)  How  many  producers  use  it?' 

( 3)  Has  the  data  base  been  built  ?' 

b.  Are  the  program  steps  (operations'^  algorithm,  etc.)  common  to 
other  programs  ? 

c.  Is  the  program  and  data  base  description  accessable? 

(1)  Is  it  releasable  to  contractor  ( Security)  ? 

(2)  How  soon  can  it  be  delivered  to  contractor  (where  do  we  get 

it  and  must  it  be  sanitized)  ?  \ 

d.  Has  program  been  written  for  1604B  class  machue,  i.e. ,  is  it  one 

of  these:  7090,  1410,  1218?  ^ 


D.  Evaluation  Criteria  and  Measures 


\ 


The  following  criteria  and  measures  were  developed  by  RADC,  Auerbach  Corporation 


1 

and  Goodyear  Aerospace  Corporation  for  use  in  their  respective  investigations: 

;  j 

Criteria 

Measures 

t 

A.  Complexity  of  Programming 

1. 

No.  of  Instructions  in  program. 

Required 

2. 

Total  time  and  cost  of  program . 

3. 

Time  and  cost  oj  producing  general 

flow  charts  (problem  analysis  & 

definition) . 

4. 

Time  and  cost  of  detailed  flow  charts 

(required  for  time  analysis) . 

B.  Accuracy  of  Programming 

1. 

No.  of  errors  listed  in  (A3)  and 

(A4)  above. 

2. 

Type  of  errors  located  in  ( A3)  and 

« 

\ 

(A4)  above. 

* 

C.  Complexity  of  Hardware  Required 

1. 

Type  and  number  of  components  used 

(AM' 

in  AM  e.g.;  100  transistors. 

2. 

Cost  of  AM  and  necessary  interface . 

k"  ■  “ 

3. 

Time  and  cost  for  each  AM  macro 

instruction. 

'i 

4. 

Time  and  cost  for  each  basic  AM 

'ik  ■ 

instruction. 

5. 

No.  of  AM  searches  required  to 

{Lm:- 

complete  each  AM  macro  &  basic 

ainv'  ’ 

Instruction. 

xi 


Criteria  (Cont.)  Measures  ( Cont.) 

D.  Reliability  of  Hardware  1.  Reliability  of  each  component  multi¬ 

plied  by  some  factors,  e.g. ,  no.  of 
components . 


E.  System  Efficiency  1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 


Time  to  execute  program  ( machine 
run  time) . 

Total  cost  of  programming,  hardware 
and  machine  running  time. 

No.  of  AM  &  1604B  instructions  used 
(and  not  used)  and  frequency  of  each. 
Amount  of  data  that  is  read  into  and 
out  of  AM  and  frequency  of  transfers . 
Time  AM  is  idle. 

Time  required  for  1604B  to  set  up 
AM  processing  (data  transfers  etc.) . 
Ratio  of  storage  requirements  vs 
availabilities. 

Time  1604B  is  idle. 

Time  disc  is  idle.  N/A 
%  of  relative  running  &  idle  times  for 
the  1604B,  Disc,  &  AM. 


& 

N/A  -  Not  Applicable  to  Goodyear 
m.  Use  of  Study  Results: 

The  results  of  this  study  provide  a  good  reference  point  when  determining  the  usefulness 
of  proposed  associative  memories.  It  is  believed  that  this  type  of  investigation,  i.e. . 
within  a  system  context  is  a  most  important  and  necessary  step.  It  is  our  hope  that 
workers  in  the  associative  memory  field  can  use  this  and  future  studies  as  a  yardstick 
in  designing  associative  memory  -  general  purpose  computer  configurations .  R ADC 
encourages  a  close  examination  of  criteria  ami  measures  to  be  used  in  evaluation  of 
associative  memories.  We  also  encourage  those  interested  to  examine  this  effort 
critically  and  would  make  welcome  suggestions. 

RONALD  /.  FERRIS 
Project  Engineer 
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SECTION  I.  MANAGEMENT  SUMMARY 


1.1  INTRODUCTION 

During  the  past  few  years  a  great  deal  of  speculation  has  arisen  concerning 
the  effectiveness  cf  an  associative  memory  (AM),  both  large  and  small,  within  a 
computing  system.  The  concept  of  an  associative  memory  in  which  data  is  accessed 
by  content  rather  than  by  its  physical  address  has  had  considerable  intuitive  appeal 
for  those  interested  in  data  manipulation  rather  than  computational  problems. 

The  technology  associated  with  implementing  such  devices  has  made  some 
progress.  Several  small  associative  memories  have  been  implemented  which  contain 
approximately  2, 048  words.  Such  memories  have  been  constructed  from  so-called 
BILOC,  BICORE,  or  similar  elements.  This  technology  is  rather  limited  with  re¬ 
spect  to  the  core  sizes  that  can  be  achieved  at  a  reasonable  cost.  The  technology 
associated  with  large  memories  is,  on  the  other  hand,  being  developed  at  the  research 
level,  using  super-conductive  elements. 

Although  considerable  attention  has  been  paid  to  the  hardware  aspects  f<  • 
both  large  and  small  associative  memories,  several  other  aspects  of  importance 
have  not  received  adequate  attention.  Thus,  if  one  reviews  the  extensive  literature 
noted  in  the  bibliography  of  this  volume,  very  little  may  be  found  which  describes 
the  manner  in  which  an  AM  can  be  integrated  within  a  computer  system,  the  program¬ 
ming  tools  that  are  required  to  take  effective  advantage  of  the  memory,  or  the  ad- 
vantvi  or  disadvantage  of  such  memories  in  specific  applications.  Considering  the 
early  state  of  this  growing  technology,  this  is  not  surprising. 

Rome  Air  Development  Center  (RADC)  has  developed  a  comprehensive  plan 
to  determine  the  advantages  and  disadvantages  of  both  large  and  small  associative 
memories.  The  work  reported  by  the  AUERBACH  Corporation  in  these  two  volumes 
is  part  of  this  comprehensive  plan.  Specifically,  the  study  has  focused  attention 
upon  small  associative  memories  as  typified  by  the  Coodyear  Associative  Memory 
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(a  description  of  the  Goodyear  Associative  Processor  (GAP)  is  contained  in 
Appendix  A  of  Volume  II)  within  the  context  of  a  computing  system  consisting  of 
a  conventional  computer  (CDC  1604-B),  discs  (CDC  818),  tapes,  and  conventional 
input-output  equipment.  Such  a  system  has  been  termed  by  RADC  a  hybrid  associ¬ 
ative  computer  system. 

As  a  vehicle  for  study,  RADC  has  requested  that  the  AUERBACH  Corpora¬ 
tion  investigate  a  total  data  storage  and  retrieval  system  to  determine  the  effective¬ 
ness  of  a  so-called  hybrid  associative  computer  system  for  such  a  problem,  and  to 
answer  some  of  the  following  questions: 

(1)  What  is  the  most  effective  hybrid  system  for  the  problem 
studied? 

(2)  How  can  an  associative  memory  be  best  integrated  into  the 
computer  system? 

(3)  What  are  the  costs  associated  with  developing  such  devices? 

(4)  What  possible  variants  of  technology  might  enhance  the 
AM  or  decrease  the  cost  with  small  effect  on  the 
processing? 

(5)  How  effective  are  the  variant  systems? 

(6)  Will  any  of  the  AMs  substantially  improve  the  overall  system 
processing  to  warrant  its  use? 

(7)  Is  a  small  associative  memory  effective? 

(8)  What  is  the  influence  of  an  AM  on  programming? 

(9)  What  programming  tools  are  required  to  facilitate  use  of  an  AM? 

This  management  report  providea  the  answera  and  limitations  upon  the  answers  to  the 
above  questions.  References  to  Volume  n,  which  contains  detailed  technical  support, 
are  provided  so  that  the  reader  may  delve  further  into  technical  material,  if  he  so 

desires. 


This  study  is  only  one  part  of  a  larger  and  more  comprehensive  effort 
being  directed  by  RADC,  and  while  this  investigation  has  a  clearly  defined  and  self- 
contained  goal,  the  larger  context  must  be  considered.  Insights  and  specific  results 
and  conclusions  of  this  study  are  reported  to  RADC  whether  they  are  of  primary 
importance  to  the  fulfillment  of  the  contract  or  a  secondary  result  of  the  study.  One 
of  the  goals  of  the  study  is  to  aid  RADC  in  the  evaluation  of  the  applicability  and 
usefulness  of  a  class  of  small  associative  memories  available  in  the  present  state 
of  the  technology. 

Within  the  framework  of  the  larger  RADC  effort,  the  overall  goal  of 
this  investigation  maybe  stated:  Evaluate  the  applicability,  effectiveness,  and 
efficiency  of  hybrid  associative  processors.  A  sufficient  range  of  hybrid  organiza¬ 
tions  and  depth  of  analysis  is  to  be  included  to  ensure  a  concrete  and  positive 
evaluation.  The  investigation  is  to  consider  both  programming  and  equipment 
factors. 

In  addition  to  the  overall  goals  specified  above,  several  criteria  for 
aiding  in  the  evaluation  of  the  AMs  studied  were  specified  by  RADC;  quantitative 
measures  which  should  be  obtained  to  aid  in  the  evaluations  were  also  specified. 

Such  measures  as  number  of  instructions,  number  of  transistors  and  flip  flops,  and 
time  to  perform  algorithms  for  both  hybrid"  associative  configurations  and  conven¬ 
tional  systems  were  utilized  as  quantitative  measures  and  were  developed.  Some 
measures,  such  as  cost  for  each  basic  AM  instruction,  cannot  be  answered.  Once 
a  basic  AM  capability  exists,  the  cost  of  a  particular  AM  instruction  may  be  insig¬ 
nificant  How  one  apportions  the  cost  of  an  instruction  to  the  overall  basic  capa¬ 
bility  could  not  be  determined  adequately.  The  study  was  conducted  using  as  a  basis 
a  problem  termed  sea  surveillance  and  defined  in  the  Office  of  Naval  Research  (ONR) 
document  entitled,  "Sea  Surveillance  Data  Base  Representation  as  Test  Vehicle.  " 

The  reasons  for  selecting  this  problem  are  given  in  the  subsequent  paragraphs. 

Within  this  context,  decisions  have  been  made  affecting  the  specific  scope,  methodology, 
and  approach  of  this  and  other  AM  research  projects.  Such  decisions  are  noted  in 
subsequent  paragraphs  of  this  section. 


1. 1.  2  Problem  Selection 


The  specific  problem  utilized  as  a  study  vehicle  was  selected  by  RADC. 

It  was  recognized  that  the  study  results  would  depend  upon  the  potential  that  a 
hybrid  associative  computer  system  might  have  to  aid  the  problem  selected.  The 
problem  area  selected  for  study  was  the  storage  of  data  and  the  retrieval  of  formatted 
files  relevant  to  the  intelligence  community. 

Before  selecting  a  specific  problem  for  study,  RADC  developed  several 
criteria  which  the  selected  problem  should  satisfy.'  The  problem  should  be  an 
actual  one  within  the  intelligence  community;  a  known  conventional  solution  should 
exist;  the  system  should  be  operational,  should  have  known  statistics  concerning 
such  factors  as  the  number  of  files,  size  of  data  base,  and  queries  posed  by  the 
user;  and  the  data  base  should  not  be  so  small  as  to  be  trivially  accommodated  by 
current  equipment  and  techniques.  Several  potential  data  bases  were  reviewed  (see 
Paragraph  1.3  of  Volume  II),  and  a  problem  termed  sea  surveillance  was  selected 
for  study.  This  problem  satisfied  most  of  the  selection  criteria,  and,  in  addition,  an 
unclassified  version  of  this  problem  was  available  and  defined  in  the  aforementioned 
ONR  document.  Several  solutions  to  the  sea  surveillance  problem  are  known  to  exist. 
(Note:  The  sea  surveillance  problem  reported  in  this  document  is  related  only  to  the 
above  referenced  ONR  document  and  may  bear  no  resemblance  to  any  actual  sea 
surveillance  problem  which  may  exist. ) 

1. 1. 3  Study  Approach* 

Several  approaches  can  be  taken  in  investigating  the  effectiveness  of  small 
associative  memories.  Each  approach  has  certain  advantages  and  disadvantages. 

One  approach  widely  used  to  evaluate  alternative  conventional  equipment  is  to  select 
several  complicated  problems  and  to  code  these  problems  in  detail  for  the  alternative 
configurations.  Advantages  to  such  an  approach  are: 

(1)  A  large  number  of  manageable  problems  can  be  defined,  detailed, 
and  analyzed. 

(2)  Alternative  situations  can  be  considered,  e.g. ,  alternative  data 
structures,  record  and  file  organizations. 

*Also  also  evaluation 
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(3)  Alternative  problems  bearing  upon  several  subject  areas  can  be 

considered,  e.  g. ,  compiler  developments,  network  flow  problems, 
and  data  retrieval. 

Disadvantages  a  endant  with  such  an  approacn  are: 

(1)  The  importance  that  should  be  attached  to  each  individual  prob¬ 
lem  is  difficult  u>  define  since  its  frequency  of  use  in  a  particular 
system  may  be  difficult  to  assess. 

(2)  Problems  of  importance  to  specific  areas  of  interest  on  a  parti¬ 
cular  problem  may  not  be  covered. 

A  second  approach  is  to  select  a  single  comprehensive  problem,  potentially 
rich  in  associative  memory  operations,  for  study.  This  approach  has  several  advan¬ 
tages: 

(1)  A  comprehensive  system  is  viewed  for  all  data  processing  func- 
tions  so  that  the  frequency  of  operations  is  known  and  no  processes 
are  overlooked. 

(2)  Areas  which  may  not  have  been  selected  for  associative  memory 
operations  may  be  uncovered  by  reviewing  all  other 
processes. 

/ 

(3)  Assumptions  as  to  the  location  of  data  which  may  tend  to  bias  a 
solution  towards  an  associative  memory  are  avoided  since  one  must 
take  cognizance  of  the  precise  location  of  data  as  developed  in 
previous  processing  steps. 

(4)  The  associative  memory  can  be  utilized  to  store  more  than  one 
class  of  data  as  required  by  dynamic  system  processing. 

As  with  the  aforementioned  approach,  there  are  certain  inherent  disadvantages.  Some 
of  these  are: 

(1)  The  solution  reached  for  a  particular  problem  may  be  too  specific 
and  may  preclude  generalization  to  other  problems. 

(2)  The  specific  problem  may  be  such  that  the  full  extent  of  the  AM 
is  not  utilized. 
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(3)  Alternative  solutions  might  not  be  evaluated  sufficiently  since 
adequate  time  for  investigating  each  alternative  might  not  be 
available. 

(4)  Excessive  time  might  be  spent  in  systems  design  of  the  problem 
and,  as  a  result,  take  time  away  from  study  of  the  details  of 
associative  memory  processing. 

Realizing  the  advantages  and  disadvantages  attendant  with  each  approach, 

RADC  selected  the  latter  for  the  AUERBACH  Corporation  and  the  former  for  other 
contractors.  The  study  realized  all  of  the  advantages  in  this  approach.  By  being 
forced  to  think  at  both  a  systems  and  a  detailed  level,  many  observations  concerning 
associative  memory  processing  have  been  developed,  At  the  same  time,  several  of 
the  above  disadvantages  are  pertinent  The  solution  reached  is  believed  to  be  suf¬ 
ficiently  general  and  applicable  to  problems  of  the  same  class.  However,  the  queries 
and  maintenance  operations  specified  as  'typical'  in  the  problem  selected  by  RADC  for 
study  (the  sea  surveillance  problem),  did  not  sufficiently  exercise  the  AMs  studied. 

That  is,  the  queries  were  very  simple  and  did  not  require  much  processing  while  the 
data  resided  in  the  associative  memory.  More  complex  questions  could  have  shown 
the  AMs  to  greater  advantage.  Alternative  solutions  (e.  g. ,  different  file  structures) 
could  not  be  studied.  Thus,  perhaps  a  more  clever  file  structure  than  that  utilized  in 
this  study  could  have  been  devised  which  would  have  shown  the  AMs  to  greater  advantage. 
The  study  team  cannot  assess  that  since  sufficient  time  did  not  exist  for  exploring  this 
possibility.  They  can  verify  that  some  time  was  spent  in  systems  design  which  could 
have  been  spent  more  profitably  on  the  details  of  AM  programming.  In  spite  of  this, 
much  was  learned  in  terms  of  organizing  a  system  around  an  AM  and  programming 
for  associative  memories.  The  study  team  believes  that,  in  the  main,  the  advantages 
of  the  approach  outweighed  the  disadvantages. 

1. 1. 3. 1  Hardware  Approach.  RADC  has  purchased  an  experimental  associative  mem¬ 
ory  from  the  Goodyear  Aerospace  Corporation.  This  AM  has  been  termed  the  Goodyear 
Associative  Processor  (GAP)  and  is  to  be  attached  to  the  1604-B  at  RADC.  AUERBACH 
was  asked  to  utilize  the  GAP  in  their  study,  and,  more  broadly,  to  investigate  variations 
of  the  memory  based  upon  the  Goodyear  technology  and  within  the  state  of  the  art. 
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There  are  several  AM  variants  that  warrant  study.  These  are: 

(1)  Logical  Organization  of  the  AM.  The  variations  range 

from  AMs  which  have  minimal  control  units  requiring  detailed 
control  by  instruction  from  the  1604-B  to  ones  which  are  inde¬ 
pendent  of  the  1604-B. 

(2)  Interface  with  the  1604-B.  The  variations  possible  here  range 
from  one  which  requires  no  modification  to  the  1604-B  to  one 
which  has  a  special  channel  and  special  register  and  finally  to  one 
fully  integrated  with  the  1604-B. 

(3)  Instruction  Set.  The  possible  variations  range  from  a  minimal 
set,  which  would  decrease  costs  of  the  AM,  to  one  which  has  ex¬ 
tensive  search  capabilities  and  which  generalizes  the  GAP 
capabilities. 

The  specific  memories  studied  were  developed  after  extensive  meetings  between 
programmers  and  the  hardware  designers.  Cost  analysis  played  a  major  role  in  the 
selection  of  the  specific  variants  that  were  to  be  exercised  in  the  study.  To  determine 
the  costs  involved,  the  GAP  technology  was  studied  thoroughly,  logic  diagrams  were 
developed,  and  estimates  made  of  the  amount  of  equipment  that  would  be  required. 
AUERBACH'S  knowledge  of  the  state  of  the  art  of  integrated  circuit  technology  and  BILOC 
devices  was  utilized.  Wherever  the  cost  analysis  was  to  show  a  minor  increase  in  cost 
to  implement  a  particular  capability  desired  by  a  programmer,  the  feature  was  '  >ded. 

Cost  analysis  information  is  provided  in  Section  n,  Volume  n. 

To  determine  those  associative  memories  that  should  be  studied,  a  mechaniza¬ 
tion  and  configuration  matrix  depicting  possible  configurations  was  developed  (Table  1-1). 
The  range  of  hardware  was  to  be  from  a  basic  minimum  capability  which  can  still  operate 
as  an  AM  through  one  which  represents  a  full  parallel  capability.  The  interface  with  the 
central  processor  represented  a  second  major  variable  to  consider.  By  selecting  systems 
represented  by  dements  of  the  matrix  for  study •  one  could  estimate  the  effect  of  these 
systems  not  covered,  and  the  importance  of  the  feature  for  the  sea  surveillance  problem. 
Time  prevented  a  study  of  all  possibilities;  a  specific  area  which  was  not  studied  was  the 
possibility  of  utilizing  thin  film  for  the  response  store.  The  memories  specifically  in¬ 
vestigated  are: 
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TABLE  1-1.  AM  LOGIC  MECHANIZATION  POSSIBILITIES 


Connection  and 

Configuration 

Possibilities 

Minimum 

Hardware 

Minimum 

Parallel 

Hardware 

Full 

Parallel 

Hardware 

Response  Store 
in  Thin  Film 

! 

Peripheral 

Device 

Buffer 

Channel 

P 

NP 

NP 

NP 

Transfer 

Channel 

P(A1) 

P  (A2) 

P 

P 

Special  IMA 
Channel 

NP 

P 

P  (GAP) 

j 

P 

Multiprocessor 

NP 

NP 

P* 

P* 

Integrated 

NP 

P 

P  (A3) 

P.  j 

;  Special  I/O  or  Search 
Controller 

NP 

NP 

P* 

i 

p* 

Ke£ 


NP:  Not  a  practical  system  (unbalanced). 
P:  A  practical  system  worthy  of  study. 


()  :  The  notations  in  ( )  are  the  designations  of  the  configuration 
studied  in  depth. 

*  :  Indicates  a  configuration  of  especial  promise  and  interest. 


(1)  A1  —  an  AUERBACH  designed  memory  which  represents  a  mini¬ 
mum  hardware  system  interconnected  with  the  CDC  1604-B  via  a 
direct  transfer  channel. 

(2)  A2  —  an  *  UERBACH  designed  memory  which  represents  a  system 
with  minimum  parallel  hardware  interconnected  with  the  CDC  1604-B 
via  a  direct  transfer  channel.  The  A2  memory  contains  eight  tag 
bits  while  no  other  memory  studied  contains  tag  bits. 

(3)  GAP  —  a  Goodyear  Aerospace  Corporation  memory  which  repre¬ 
sents  a  system  with  full  parallel  hardware  interconnected  with  the 
CDC  1604-B  via  a  special  Direct  Memory  Access  (DMA)  channel. 

(4)  A3  —  an  AUERBACH  designed  memory  which  represents  a  system 
with  full  parallel  hardware  fully  integrated  with  the  CDC  1604-B. 

By  fully  integrated,  it  is  meant  that  the  last  2, 048  words  of  CDC 
1604-B  core  have  been  replaced  by  AM  and  can  be  utilize^  both  as 
conventional  core  and  as  an  associative  memory. 

The  AUERBACH  designed  memories  are  feasible  and  are  based  on  the  tech¬ 
nology  developed  by  the  Goodyear  Aerospace  Corporation.  These  memories  are  de¬ 
scribed  in  detail  in  Section  II  of  Volume  D,  and  a  comparison  of  the  configurations  is  given 
in  Section  n  of  this  volume. 

The  basic  elements  of  an  associative  memory  are: 

(1)  The  memory  array  —  which  provides  the  data  storage  itself. 

(2)  Hie  comparand  register  —  which  contains  the  data  to  be  com¬ 
pared  against  the  contents  of  the  memory  array  for  seprches, 
provides  a  shifting  register  for  some  operations,  and  can  play 
an  intermediate  role  in  the  transfer  of  data  between  the  memory 
array  and  the  central  processor. 

(3)  The  mask  register  —  which  is  used  to  contain  data  specifying 
portions  of  words  for  operations  involving  only  word  portions. 
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(4)  The  resolver  —  which  is  used  to  determine  the  location  of 
response  bits  in  the  response  store. 

(5)  The  search  logic  —  which  causes  the  search  qomraands  received 
by  the  memory  to  be  properly  executed. 

(6)  The  response  store  —  which  receives  vectors  indicating  which 
data  satisfy  a  given  search  criterion,  and  which  can  execute 
logical  operations,  such  as  shifting  and  Boolean 
operations,  on  these  vectors. 

In  addition,  an  associative  control  capability  is  provided.  This  capability  is 
Utilized  to  decode  instructions. 

These  elements  are  presented  in  Figure  2-1. 

The  search  logic  and  response  vector's  storage  registers  in  the  Goodyear 
Associative  Processor  count  for  approximately  45  percent  d  the  production  cost  of  the 
GAP.  It  i-  clear  that  large  reductions  can  be  made  in  the  amount  of  hardware  associ¬ 
ated  with  the  response  store.  Since  it  was  one  of  the  objectives  of  this  study  to 
evaluate  both  sophisticated  and  unsophisticated  associative  memory  features,  some 
features  were  expanded  and  others  eliminated  in  the  configurations  studied. 

1. 1. 3.  2  Software  Approach.  The  sea  surveillance  problem  referenced  in  the  ONR 
document  lists  a  small  set  of  queries  that  must  be  answered  (see  Pai  agraph  3.  5,  Sec¬ 
tion  HI,  Volume  II).  One  can  organize  a  system  which  can  handle  the  specific  queries 
listed  in  an  efficient  manner  while  other  queries  are  handled  inefficiently.  Such  an 
approach  could  not  be  taken  for  this  study  since  the  results  obtained  would  be  useless 
for  problems  of  a  similar  nature  to  sea  surveillance.  Indeed,  the  nature  of  intelligence 
problems  is  such  that  if  a  system  were  based  exclusively  on  the  recognized  requirements, 
and  not  generalized,  other  requirements  would  shortly  arise  possibly  obviating  what  was 
achieved  and  requiring  extensive  reprogramming. 

The  approach  taken  was  Ij  design  a  system  that  was  not  biased  towards  any 
specific  queries,  but  organized  to  handle  a  wide  varievy  of  queries  that  the  user  could 
specify.  Thus,  the  user  is  permitted  to  specify  queries  in  which  the  search  criterion 
is  any  logical  condition  which  contains  the  logical  connectives  AND,  OR,  and  NOT  to 
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any  complexity  he  desires.  Within  this  context,  several  of  the  queries  specified  in 
Parag  'aph  3.  5  of  Volume  II  have  been  .  med  both  for  AM  configurations  and  non-AM 
configurations.  Queries  other  than  those  specified  (e.  g. ,  having  arbitrary  logical 
conditions)  can  also  be  effected  utilizing  the  mechanism  developed  in  this  study.  In  addition 
to  queries,  consideration  has  been  given  to  the  problem  of  new  inputs  and  maintenance. 
Maintenance  orders  and  new  inputs  can  be  timed,  if  so  desired. 

To  develop  a  unified  approach  for  the  sea  surveillance  problem  to  be  applic¬ 
able  for  all  configurations  was  a  challenge.  Although  a  unified  approach  was  achieved, 
all. features  of  each  AM  could  not  be  studied  in  detail.  However,  GAP  was  studied  fully. 

A  generalized  file  structure  has  been  developed  which  is  applicable  for  all  configurations. 
The  file  structure  is  biased  towards  AM  processing.  In  addition,  an  executive  control 
routine  which  analyzes  queries  and  handles  them  in  a  general  manner  was  developed  as 
was  a  general,  approach  for  input  processing  and  maintenance.  To  show  the  variation 
in  AM  and  non-AM  configurations,  machine  code  was  developed  for  GAP  and  non-AM 
configurations.  Coding  for  the  variant  AMs  was  not  performed  explicit] v  In  one  instance. 

the  instruction  complement  of  the  AM  (A3)  was  such  that  GAP  coding  was  a  subset  of  the 

\ 

AM  and  its  effect  could  be  estimated;  in  a  second  instance,  the  GAP  was  "simulated"  and 
the  variation  in  timing  noted. 

'V 

1.1.4  Study  Restrictions 

Several  study  restrictions  are  important  to  note  as  they  affect  the  generality 
of  the  results.  The  restrictions  imposed  by  RADC  are  as  follows: 

(1)  Small  Associative  Memories.  The  study  was  restricted  to  include 
only  associative  memories  within  the  state  of  the  art.  Thus,  only 
small  associative  memories  have  been  investigated.  In  particular, 
the  Goodyear  Associative  Processor  was  specified  by  RADC  as 
typical  of  the  technology  of  associative  memories.  The  observations 
and  conclusions  developed,  therefore,  do  not  necessarily  extend  to 
large  associative  memories.  The  hardware  techniques  for  large 
AMs  will  be  considerably  different  than  those  of  GAP. 

(2)  Central  Processor.  The  central  processor  to  be  utilized  as  typical 
was  the  CDC  1604 -B.  It  should  not  be  construed  that  this  is  an  ideal 
central  processor.  No  attempt  was  made  to  deter  mine  an  "ideal’ central 
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processor.  Features  of  the  1604-B  that  proved  cumbersome  are 
noted. 

(*>/  Disc  Un;  .  RADC  selected  the  CDC  818  Disc  for  the  study  since  it 
is  attached  to  the  1604-B  at.  the  RADC  facility.  The  file  structure 
developed  o  focus-in  upon  the  data  was  tailored  towards  the 
pe 'uliarities  of  this  disc  to  avoid  bias.  For  a  different  disc,  a 
modified  file  structure  would  have  to  be  developed. 


1.2  FINDINGS 

This  paragraph  presents  the  study  findings  in  summary  form,  with  brief  ex¬ 
planations.  (Volume  n  provides  ample  supplementary  information  to  support  the  findings. ) 
Where  pertinent,  references  to  Volume  II  are  provided.  The  study  findings  fall  into  three 
broad  areas  of  interest: 

(1)  Hardware  —  findings  with  respect  to  the  nature,  organization, 
utility,  and  value  of  features  of  associative  memories. 

(2)  Programming  —  findings  with  respect  to  using  an  associative  hybrid 
processor  for  the  performance  of  defined  tasks  and  subtasks. 

(3)  Concepts  of  Systems  —  findings  affecting  the  concepts  underlying 
associative  processing,  the  design  of  data  bases,  and  general  ef¬ 
fectiveness  of  AM  processing. 

In  general,  the  findings  pertain  only  to  the  particular  class  of  problem  studied 
and  reported  upon  in  these  volumes.  Extension  of  the  comments  to  other  applications  may 
be  warranted,  but  requires  further  study.  The  more  important  findings  are  noted  in  this 
paragraph  by  capital  letters  and  underscore'*. 

1. 2. 1  Hardware  Findings 

Desirable  and  non-desirable  hardware  findings  were  obtained  as  a  result  of 
designing  and  programming  the  sea  surveillance  system.  One  of  the  most  important  of 
these  findings  concerns  the  interface  provided  for  GAP. 
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THE  EFFECTIVENESS  OF  AN  AM  FOR  PRO 


LEMS  SUCH  AS  SEA 


SURVEILLANCE  IS  HIGHLY  DEPENDENT  UPON  THE  AM  INTER- 


FACE  WITH  THE  DIGITAL  COMPUTER.  THE  A3  WAS  FOUND  TO 


HAVE  THE  BEST  INTERFACE. 


The  sea  surveillance  problem  is  primarily  input- output  limited.  Because 
data  transfers  to  AM  go  through  core,  GAP  was  found  more  time  consuming  for  per¬ 
forming  the  simple  queries  studied  than  a  non-AM  solution;  the  A3  configuration,  which 
was  fully  integrated  with  the  1604-B,  required  the  same  amount  of  time.  In  the  A3 
configuration  the  data  base  is  stored  on  disc.  To  take  full  advantage  of  the  AM  requires 
direct  transfer  of  data  from  disc  to  AM  without  intermediate  core  transfers.  More  com¬ 
plex  queries  would  show  A3  to  advantage  over  the  non- AM  solution  and  GAP.  The  A3 
configuration  was  found  to  have  the  best  interface.  In  the  A3  configuration,  the  associa¬ 
tive  memory  is  fully  integrated  into  the  1604  memo."'  as  shown  in  Figure  1-1.  In  GAP, 
the  associative  memory  is  attached  to  the  direct  mei.  ’v  access  channel,  much  as  one 
would  attach  another  unit  to  the  computer  control.  In  Ax  ?  id  A2,  the  memories  are  at¬ 
tached  to  input-output  Channel  7,  much  as  peripheral  devices  are  attached. 

The  interface  between  GAP  and  the  1604  is  via  a  direct  memory  access  channel 

(DMA). 

THE  USE  OF  THE  DIRECT  MEMtW  ACCESS  CHANNEL  AND  THE 

"MULTIPROCESSING"  CAPABILITY  PROVIDED  FOR  THE  GAP 

HYBRID  CONFIGURATION  HAD  MAJOR  SHORTCOMINGS. 

This  finding  was  contrary  to  some  expectations  and  is  based  upon  the  following: 

(1)  The  overhead  needed  to  seu^,  control,  coordinate,  and  synchronize 
GAP,  because  it  operates  as  a  "multiprocessor"  on  the  DMA,  is 
very  high  (Paragraph  2.  5.  5. 1,  Section  n,  Volume  13). 

(2)  In  some  cases,  contrary  to  programmer  intuition,  it  is  better  to 
code  a  problem  using  inelegant  coding  rather  than  exiting  from  GAP 
coding  to  perform  1604-B  coding  and  returning  to  GAP.  This  arises 
because  of  the  overhead  associated  with  the  DMA  "multiprocessing" 
character  of  GAP.  An  illustration  of  such  coding  is  shown  in 
Paragraphs  7. 2. 1. 2  and  7. 3. 5  of  Volume  n. 
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Arithmetic  Unit 


Figure  1-1.  Place  of  Associative  Memory  in  Internal  A3 
Configuration  of  1604-B 
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(3)  Lack  of  an  interrupt  capability  limits  communication  between  GAP 
and  the  1604-B  and  seriously  limits  GAP  as  a  "multiprocessor"  on 
the  DMA  channel.  An  interrupt  on  Halt  would  improve  GAP- 1604-B 
communications. 

The  cost  of  an  associative  processor  was  an  item  of  major  importance  to  the 
study.  The  most  important  finding  with  respect  to  cost  follows. 

THE  ELEMENTS  OF  COST  OF  AN  ASSOCIATIVE  MEMORY  ARE  THE 

ASSOCIATIVE  MEMORY  ARRAY  AND  ASSOCIATED  ELECTRONICS; 

RESPONSE  STORE;  AND  REGISTERS.  GATES.  AND  CONTROLS.  THE 

RESPONSE  STORE  CAN  BE  VARIED  SIGNIFY ANTLY  TO  DECREASE 

COSTS. 

A  cost  analysis  was  performed  on  each  configuration  studied.  The  relative 
costs  are  shown  in  Figure  1-2.  The  direct  interface  of  A3  with  the  1604-B  and  increased 
instruction  capability  surprisingly  do  not  increase  the  cost  of  the  AM.  Since  all  AM»  studied 
utilize  the  Goodyear  technology,  the  co«t  of  the  basic  array  is  the  same.  Note  that  one 
can  afford  approximately  twice  as  much  AM  storage  capacity  in  A2  for  the  price  of  one 
GAP.  i 

The  capability  to  communicate  results  of  an  operation  to  the  program  coding 
itself  is  the  basis  of  data  dependent  processing.  This  capability  can  be  called  the  intra¬ 
communication  capability  of  a  processor. 

GAP  LACKS  INTRACOMMUNICATION  CAPABILITIES. 

AM-to-AM  transfers,  plus  AM  modification  by  index  registers,  proved  a 
limitation  in  the  data  manipulation.  The  A3  configuration  is  not  so  restricted  nor  is 
A2  (but  A2  is  restricted  to  a  lesser  degree  than  A3).  Data- dependent  processing  as  required 
by  sea  surveillance  type  problems  requires  this  feature  (see  Paragraph  7.3.5.  Volume  II). 


valuable. 


Some  hardware  features  of  the  GAP,  particularly  the  "buffers",  proved 


THE  "P"  AND  "E"  BUFFERS  IN  GAP  PROVIDE  SIGNIFICANT  CAPA¬ 
BILITIES  AND  CAN  BE  GENERALIZED. 


100% 


Memory  Array  and 
Associated  Electronics 

Response  Store 

Registers,  Gates 
and  Control 

Figure  1-2.  Relative  Costs:  GAP,  A3,  A2,  and  A1 
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The  P,  T,  V,  and  Z  fields  of  GAP  instructions  permit  the  programmer  to 
take  advantage  of  the  capability.  Capabilities  in  the  D  and  E  buffers  are  generalized 
in  A3  over  GAP.  The  following  also  pertains: 

(1)  This  capability  proved  less  useful  than  the  capability  of  storing 
all  response  vectors  in  1604  memory,  or  in  tag  bits,  until  needed. 
This  observation  may,  however,  be  the  result  of  the  peculiarities 
of  the  data  base  design  studied. 

(2)  The  capability  to  shift  down  D  and  E  buffers  and  response  store 
is  essential.  The  capability  to  shift  buffers  up  is  desirable,  but 
may  not  be  v  orth  the  cost. 

Hardware  provided  for  A2  only  proved  worthwhile,  especially  in  the  built-in  features  for 
using  the  tag  memory  concept.  The  following  finding  was  supported  by  the  study  of  A2. 

TAG  BITS  ARE  IMPORTANT  AND  ARE  REQUIRED. 

The  absence  of  tag  bits  proved  a  major  limitation  in  GAP  and  A3,  although 
substitutes  could  be  found  in  both  cases.  By  utilizing  the  second  half  of  GAP  as  tag  bits, 
one  achieves  this  result.  Actual  tag  bits  generalized  as  in  A2  are  a  less  costly  solution 
than  wasting  the  entire  second  half  of  GAP  for  a  tag  memory.  On  the  other  hand,  some 
deficiencies  in  the  instruction  repertoire  of  GAP  were  discovered. 

DESIGN  OF  GAP  INSTRUCTIONS  REQUIRES  IMPROVEMENT. 

(1)  Manipulation  of  the  LDR  instruction  was  found  to  be  cumbersome. 
This  instruction  is  utilized  so  frequently  as  to  require  improvement. 

(2)  The  relatively  complicated  searches,  such «s-NHG-(next  higher  than 
comparand)  and  ALC  frext  lower  than  comparand)  proved  to  be  easy 
to  program  and  might  not  be  worth  additional  cost 

(3)  Readout  of  addresses  and  count  of  responders  should  be  inserted  into 
the  desired  1604-B  word  rather  than  fill  the  unused  bits  of  the 
words  with  zeros. 
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1.2.2 


Programming  Findings  ^ 

The  study  of  the  AM  hybrid  configurations  in  solving  the  sea  surveillance 
problem  gave  ample  opportunity  for  collecting  data  about  programming.  This  para¬ 
graph  reviews  the  most  important  findings  about  programming  uncovered  by  Jhe  study. 

The  findings  are  relevant  to  programme]*  learning  experience,  ease  of  programming, 
most  used  and  least  used  features,  most  common  programming  problems*, .software 
tools  required,  and  efficiency  of  AM  hybrid  coding.  No  results  could  be  obtained  on 
the  accuracy  of  coding  since  no  actual  machine  testing  was  possible. 

1. 2.  2. 1  Programmer  Learning  Experience.  Programmers  assigned  to  the  sea 
surveillance  problem  were  experienced  senior  program  designers  familiar  with  a 
wide  variety  of  conventional  and  unconventional  processors.  ALL  PROGRAMMERS 
FOUND  GAP  DIFFICULT  TO  LEARN.  There  appear  to  be  two  major  causes  of  learn¬ 
ing  difficulty.  First,  the  GAP  programming  manual  (Appendix  A,  Volume  II)  is 
deficient.  It  lacks  a  detailed  description  of  the  effect  of  instructions  on  registers  and 
the  store,  and  is  poorly  organized  from  both  a  pedagogical  and  reference  point  of  view.  * 
Second,  the  associative  configurations  are  conceptually  different  from  other  configura¬ 
tions.  All  associative  configurations  studied  could  be  used  in  a  wide  variety  of  ways,  such 
as  tag  memories,  as  "look-up"  machines,  as  logic  machines,  and,  to  a  surprising 
extent,  as  conventional  serial  processors.  Hence,  the  programmer  was  confronted 
with  a  wide  variety  of  concepts  in  a  new  context.  Learning  to  use  these  concepts  in¬ 
volved  considerable  trial  and  error,  and  some  of  the  most  powerful  programming  tools 
were  mastered  relatively  late  in  the  project. 

Although  the  programming  staff  had  full  access  to  the  developers  of  the  Al, 

A2,  and  A3  memories,  it  is  also  true  that  these  memories  proved  difficult  to  learn. 

THE  MAJOR  CAUSE  OF  LEARNING  DIFFICULTY  IS  THEREFORE  THE  STRANGENESS 
OF  ASSOCIATIVE  PROCESSING  TO  PROGRAMMERS.  All  programmers  engaged  in 
the  project  felt  that,  with  more  experience,  they  could  probably  meaningfully  improve 
the  quality  of  their  output 

1. 2. 2. 2  Ease  of  Programming.  Although  each  programmer  has  a  feeling  for  the 
relative  difficulty  of  programming  a  machine  with  which  he  is  familiar,  there  is  no 
one  measure  of  the  relative  ease  or  difficulty  of  programming  a  particular  machine. 

♦This  was  to  be  expected  because  of  time  constraints .  The  programmer  manual  re¬ 
ceive  '  was  a  draft  copy. 
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Some  of  the  important  factors  in  determining  ease  of  programming  include  the  number 
of  instructions  and  storage  locations  affected  by  an  instruction,  the  frequency  of  itera¬ 
tive  loops  required  for  a  task,  the  relative  complexity  of  flowcharts,  and  the  number 
of  fields  that  must  be  specified  for  an  instruction.  There  is  no  convenient  way  to  com¬ 
bine  these  measures  into  one  measure  for  relative  ease  of  programming.  Programmers 
found,  however,  that  certain  tasks  were  easier  to  program  for  an  associative  processor 
than  for  a  conventional  configuration.  A  few  tasks  proved  more  difficult  to  program  for 
an  associative  processor. 

PROGRAMMING  FILE  MAINTENANCE  OPERATIONS  PROVED  EASIER.  BE¬ 
CAUSE  SEQUENCING  OF  DATA  FIELDS  WAS  USUALLY  UNNECESSARY.  Because  the 
associative  memory  makes  addressing  data  by  its  contents  convenient,  the  logically  in¬ 
volved  operations  required  to  locate  a  data  field  in  the  proper  place  within  a  record  were 
greatly  simplified.  The  data  field  value  was  placed  ajacent  to  the  field  definition,  and 
because  of  the  ability  of  an  AM  to  search  the  entire  record  at  one  time,  it  became  unnec¬ 
essary  to  locate  the  field  in  some  particular  place  within  the  record.  Since  order  could 
be  ignored  in  these  cases,  the  programming  required  to  maintain  order  could  be  bypassed. 

Some  tasks  were  proven  not  suitable  for  associative  processing.  These  tasks 
include  tasks  requiring  arithmetic  processing  such  as  calculating  distance  from  geodetic 
position  data,  and  data-dependent  processing.  FOR  ALL  TASKS  EXCEPT  ARITHMETIC 
AND  DATA  DEPENDENT  PROCESSING.  FEWER  INSTRUCTIONS  NEEDED  TO  BE 
WRITTEN  BY  THE  PROGRAMMER  THAN  WERE  REQUIRED  FOR  CONVENTIONAL 
PROCESSING.  This  fact  is  derived  from  some  peculiarities  of  associative  processing. 
FIRST.  FEWER  LOOPS  ARE  REQUIRED.  BECAUSE  OPERATIONS  CAN  BE  PER¬ 
FORMED  IN  PARALLEL.  AND  HENCE  THE  CODING  NEEDED  TO  SET  UP  AND  CON¬ 
TROL  THE  LOOPS  NEED  NOT  BE  WRITTEN.  SECOND.  ONE  CAN  PERFORM  AN 
OPERATION  ON  SEVERAL  DATA  FIELDS  AND  EVEN  SEVERAL  DATA  RECORDS  AT 
THE  SAME  TIME.  Hence,  the  coding  usually  required  to  step  through  fields  and  records 
one  at  a  time  can  be  omitted. 

The  difference  in  the  number  of  instructions  required  to  program  representa¬ 
tive  query  macros  is  shown  in  Figure  1-3.  In  this  figure  the  number  of  instructions 
written  for  GAP  is  compared  to  the  number  of  instructions  written  for  the  same  macros 
for  the  1604- B.  The  reader  must  be  very  careful  about  generalizing  the  data  of  this 
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Figure  1-3.  Number  of  Programming  Instructions  Coded  for  Query  Macros,  GAP  vs.  1604 
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figure.  The  records  processed  by  the  macros  were  designed  in  light  of  the  AM,  and, 
specifically,  order  constraints  which  do  not  exist  for  the  AM  were  ignored  in  specifying 
the  records  to  be  processed.  As  a  result,  1604  coding  to  perform  the  various  macros 
includes  numerous  housekeeping  steps  not  required  when  the  data  records  are  designed 
for  conventional  processors.  *  These  housekeeping  operations  somewhat  inflate  the  figures 
for  the  number  of  instructions  required  for  1604  processing. 

The  relative  infrequency  of  loops  and  the  capability  to  process  several  fields 
and  records  simultaneously  resulted  in  some  differences  of  flowcharting.  FLOW¬ 
CHARTS  REQUIRED  FEWER  FIXED  CONNECTORS.  AND  THEREFORE  APPEARED 
SIMPLER.  ON  THE  OTHER  HAND,  NO  NEW  FLOWCHARTING  CONVENTIONS  WERE 
NEEDED.  Except  for  the  use  of  a  few  new  phrases,  such  as  "set  responders  for  . . . .  " 
or  "AND  response  store  with  the  ....  buffer, "  the  usual  process  boxes  and  decision 
boxes  sufficed. 

The  number  of  fields  required  to  specify  an  instruction  to  GAP  and  to  A3 
seemed  excessive,  although  this  is  perhaps  a  result  of  the  strangeness  of  the  instruc¬ 
tions.  All  programmers  found  it  inconvenient  to  express  S,  N,  and  R  in  an  LDR  in¬ 
struction  for  GAP  before  every  load.  Further,  the  use  of  the  letters  "P, "  "T,  "  "V,  " 
and  "Z"  for  fields  in  GAP  is  certainly  not  mnemonic,  and  may  contribute  to  the  feeling 
that  the  GAP  instructions  are  "too  complex.  " 

Programmers  found  it  difficult  to  determine  the  quality  of  their  coding,  since 
the  relative  efficiency  of  an  algorithm  was  often  contrary  to  intuition.  There  are 
several  causes  of  this  condition.  First,  an  apparently  inefficient  algorithm  can  often 
minimize  transfers  of  control  between  the  GAP  and  1604.  WHENEVER  THE  TRANSFER 
OF  CONTROL  BETWEEN  GAP  AND  THE  1604  lb  MINIMIZED.  THE  EFFICIENCY  IS 
IMPROVED.  Consequently,  it  is  often  best  to  do  a  certain  amount  of  processing  inef¬ 
ficiently  in  GAP  rather  than  suffer  the  penalty  attendant  on  transferring  control.  Second, 
TRANSFERS  OF  DATA  BETWEEN  THE  GAP  AND  THE  1604  ARE  QUITE  COSTLY  IN 
TERMS  OF  TIME,  and  savings  in  time  gained  by  the  parallelism  of  the  GAP  can  be  made 
up  for  several  fold  by  the  need  for  data  transfers. 

♦The  results  of  this  report  are  deficient  in  this  regard,  i.e. ,  it  was  the  expressed  de¬ 
sire  of  RADC  to  optimally  design  the  conventional  system. 
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THE  MOST  SERIOUS  DEFICIENCY  OF  THE  GAP  INSTRUCTION  REPERTOIRE. 
FROM  THE  POINT  OF  VIEW  OF  THE  PROGRAMMER.  IS  THE  ABSENCE  OF  A  FULL 
CAPABILITY  FOR  HANDLING  DATA  DEPENDENT  OPERATIONS.  It  is  usually  neces¬ 
sary  to  use  the  1604  to  set  up  a  coding  sequence  for  GAP,  owing,  in  many  cases,  to 
relatively  minor  faults  in  the  conception  of  GAP.  The  Read  Address  of  Responders  in¬ 
struction  zero  fills  the  word  into  which  the  address  is  read,  making  the  use  of  dummy 
instructions  difficult.  IN  FACT.  IT  IS  SO  COSTLY  TO  ASSEMBLE  INSTRUCTIONS 
USING  GAP  CODE  THAT  THE  PROGRAMMER  IS  MOST  OFTEN  FORCED  TO  USE  THE 
1604  FOR  THIS  PURPOSE. 


In  processing  data,  the  operations  effected  are  often  determined  by  the  data 
itself.  In  conventional  processing,  the  procedure  often  used  is  to  assign  a  counter  to 
each  class  of  data  and  to  test  that  datum  corresponding  to  the  particular  counter's 
value.  The  counter  is  then  advanced.  In  this  scheme,  it  is  possible  to  permit  a  datum 
to  belong  to  more  than  one  class  Now  in  the  GAP,  the  test  for  status  of  a  particular 
class  of  data  determines  the  status  of  all  data  in  the  class  for  each  searchable  (busy  bit 
set)  word.  This  Mass  of  information  often  proved  to  be  useless  since  the  status  of  the 
"first"  responder  was  of  interest.  Then  after  examining  the  first  responder  (and  ignor¬ 
ing  all  others  until  later),  it  was  erased.  Consequently,  the  datum  could  only  belong  to 
one  class.  Additionally,  any  subsequent  test  destroyed  the  status  of  the  first  class  of 
data.  Consequently,  a' reset  cycle  had  to  be  performed.  CLEARLY.  THE  GOODYEAR 
ASSOCIATIVE  -PROCESSOR  OFFERS  NO  ADVANTAGE  IN  EFFECTING  DATA  DEPEND¬ 
ENT  PROCESSING. 

j  1  2  Most  Used  and  Least  Used  Features.  THE  MOST  USED  INSTRUCTIONS 
PROVED  TO  BE  THE  SIMPLE  SEARCHES  AND  THE  LOAD  AND  READ  INSTRUCTIONS. 
THE  LEAST  USED  INSTRUCTIONS  WERE  THE  MORE  COMPLEX  SEARCHES.  The 
simple  searches  almost  always  sufficed  for  establishing  response  vectors  used  in  proc¬ 
essing,  and  the  more  complex  searches,  such  as  next  higher/lower  than  comparand,  were 
not  only  seldom  used,  but  could  be  easily  programmed  in  any  case.  The  complex  search, 
"CPX, "  was  not  used  in  the  study.  CPX  will  prove  valuable  when  many  fields  are  con¬ 
tained  in  one  word.  In  this  system  only  one  major  field  occurs  in  a  word  in  most  cases. 
THE  LDR  INSTRUCTION  WAS  OFTEN  AN  INCONVENIENCE,  because  it  is  required  for 
the  specification  of  fields  about  which  the  programmer  usually  does  not  care.  For  the 
vast  majority  of  cases,  S  =  0  in  the  LDR,  and  R  and  N  need  not  be  specified,  because  the 
operation  is  to  range  over  a  half  memory  or  the  entire  memory. 
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THE  NEXT  HIGHER/LOWER  THAN  COMPARAND,  MAX,  MIN,  AND  OTHER 
COMPLEX  SEARCHES  WERE  LUXURIES.  These  searches  could  be  programmed  quite 
easily,  and  were  seldom  used  in  the  problem  at  hand. 

1.  2.  2.4  Most  Common  Programming  Problems  and  Solutions.  Certain  problems  were 
encountered  so  frequently  in  dealing  with  sea  surve  llance  problems  that  standardized 
solutions  were  developed  for  them.  Two  of  the  problems  fell  into  a  class  which  required 
solutions  for  this  specific  problem,  while  a  third  class  of  problems  seems  to  cast  light 
on  features  desirable  in  compilers  for  associative  processors. 

A  recurring  problem  for  programmers  using  the  hybrid  configurations  derives 
from  the  relative  size  of  th  AM.  In  those  cases  in  which  the  data  to  be  processed  ex¬ 
ceeds  the  storage  of  the  AM,  the  programmer  must  repeat  operations  on  different 
Segments  of  the  data.  The  problem  is  not  trivial,  since  the  number  of  times  the  AM 

must  be  loaded  and  the  order  i.i  which  operations  are  to  be  performed  are  variable.  For 

\ 

example,  if  it  would  require  only  two  memory  loads  to  enter  a  list  into  AM,  but  the  pro¬ 
grammer  requires  that  a  Max  search  be  performed  on  the  list  as  if  it  were  one  memory  load, 
then  three  separate  memory  loads  will  be  required  to  perform  the  simulation  of  a  larger 
memory.  If,  however,  a  search  on  equality  (EMC)  is  required,  only  two  loads  will  be 
required.  The  problem  was  solved  by  developing  a  routine  that  functions  in  the  manner 
of  certain  executive  routines;, the  routine  1  called  the  AM  Dispatcher.  It  receives  com¬ 
mands  from  a  program  using  the  AM,  and  successively  performs  operations  for  the 
program  as  if  a  longer  AM  were  available.  AN  EXECUTIVE  FUNCTIONAL  ROUTINE 
WHICH  WILL  OVERLAY  AM  STORAGE  FOR  A  PROGRAM  WILL  NO  DOUBT  PROVE 
DESIRABLE  FOR  MANY  OTHER  PROBLEMS,  AS  IT  HAS  FOR  SEA  SURVEILLANC E. 

I  * II.  Ml...  1.  ■  ■>■111.1  1 1.;.  i  ■  ^  ■■  .11.— ..»■■■■■  iii.imi  —  ■■» -  —  ■  I  .—  ..M.  .  n.  . . . —  i.— . 

A  second  class  of  problems  is  derived  from  the  peculiarities  of  GAP.  Unless 
one  is  using  GAP  as  a  Tag  Memory,  it  is  surprisingly  difficult  to  subset  the  data  in  the 
AM  in  order  to  perform  a  sequence  of  operations  on  that  subset  alone.  This  class  of 
problems  can  be  readily  solved  with  a  non-trivial  macro,  SPO.  MACROS  LIKE  SPO, 

WHICH  SERVE  THE  PURPQ6E  CF  TEMPORARILY  SUBSETTING  THE  DATA  IN  THE  AM, 


WILL  UNDOUBTEDLY  PROVE  OF  VALUE  IN  MANY  GAP  SYSTEMS  AS  SPO  HAS  IN  THIS 


Finally,  it  is  worth  stating  that  the  study  showed  some  operations  to  be 
so  frequent  and  of  such  general  utility,  as  to  perhaps  influence  the  specification  of 
compilers  for  hybrid  configurations. 

The  functions  most  used  are  listed  below : 

(1)  Identify  all  x  (i.  e.  ,  turn  on  responders  for  all  entries  identical 
to  some  masked  portion  of  a  field). 

(2)  Fetch  from  AM  all  those  data  satisfying  a  search  criterion. 

(3)  Erase  all  entries  satisfying  criteria  x. 

(4)  Substitute  x  for  y  in  all  responders. 

1.  2.  2. 5  Program  Efficiency.  To  say  that  program  A  is  more  efficient  than  program  B 
in  performing  a  certain  task  is  to  say  that  program  A  will  perform  that  task  more  econ¬ 
omically.  When  input  and  output  specifications  are  the  same  for  both  programs,  and 
whan  the  computer  systems  are  of  equal  cost  for  program  A  and  program  B,  efficiency 
is  a  function  of  processing  time.  As  is  well-known,  some  problems  on  some  computer 
systems  can  be  evaluated  for  efficiency  as  a  simple  function  of  the  time  required  to 
input  and/or  output  the  data  required.  Such  problem  configuration  systems  are  said  to 
be  "input-output  bound.  " 

When  the  sea  surveillance  problem  is  processed  with  the  proposed  data  base 
stored  on  the  8 18  Disc ,  THE  SYSTEM  IS  INPUT-OUTPUT  BOUND.  WHETHER  AN  AS¬ 
SOCIATIVE  MEMORY  IS  AVAILABLE  OR  NOT  The  time  to  perform  a  task  upon  the 
specified  system  is  so  nearly  a  function  of  the  time  required  to  input  and  output  disc 
data  that  the  relative  efficiency  of  the  AM  and  non-AM  configuration  will  be  very  nearly 
the  same.  It  is  therefore  necessary  to  define  efficiency  independent  of  the  818  Disc. 

Such  a  definition  is  not  easily  obtained.  The  oresent  system  was  designed  to  optimize 
processing  of  data  stored  on  the  818  Disc.  To  pretend  to  ignore  the  disc  system's  in¬ 
fluence  upon  the  actual  system  efficiency  is  therefore  misleading. 


Keeping  the  influence  of  the  disc  in  mind,  it  is  otill  possible  to  get  informa¬ 
tion  about  relative  efficiency  of  an  AM  augmented  system  in  contrast  to  one  without  the 
AM.  It  is  convenient  to  begin  an  inquiry  about  efficiency  by  noting  that  it  is  possible 
for  an  AM  itself  to  be  penalized  by  input-output. 

It  is  perfectly  proper  to  speak  of  an  associative  memory  as  input-output  bound 
if  the  processing  time  is  very  nearly  a  function  of  the  time  required  to  load  and/or  un¬ 
load  the  associative  memory.  This  is  true  whether  the  device  is  a  proper  peripheral 
device  (A2),  a  device  on  the  DMA  channel  (GAP),  or  a  device  integrated  into  memory 
but  still  perhaps  requiring  memory- to- memory  transfers  (A3). 

The  sea  surveillance  problem  is  such  that,  with  the  exception  of  a  few  functions 
in  file  maintenance,  the  time  required  to  load  and/or  unload  the  AM  is  a  very  good  indica¬ 
tor  of  the  processing  involved.  FOR  THE  SEA  SURVEILLANCE  PROBLEM,  WITH  THE 
EXCEPTION  OF  FILE  MAINTENANC E  AND  QUERY  PROCESSING  OPERATIONS,  PROC¬ 
ESSING  TIME  IS  NEARLY  A  FUNCTION  OF  LOADING  TIME  FOR  GAP  AND  A2. 

Ironically,  the  situation  for  GAP  and  A2  is  worse  than  being  input-output  bound 
for  this  problem.  LOADING  AND  UNLOADING  GAP  AND  A2  CANNOT  BE  OVERLAPPED 
WITH  PROCESSING.  HENCE,  GAP  AND  A2  ARE  SEVERELY  PENALIZED  BY  LOADING, 
RATHER  THAN  BEING  BOUND  BY  IT.  Processing  time,  which  is  relatively  small,  must 
be  added  to  load  and  unload  time. 

Figure  1-4  graphically  shows  the  effect  of  load  time  for  GAP.  The  solid  line 
shows  the  total  amount  of  processing  time  per  iteration  without  using  an  AM  and  assuming 
the  data  is  in  core;  the  broken  line  shows  the  total  processing  time,  per  iteration,  for 
GAP,  assuming  that  the  data  remains  in  AM  for  each  successive  iteration.  The  routine 
in  question  is  "REL"  and  it  is  perfectly  reasonable  to  assume  that  the  routine  would  be 
repeatedly  used  to  apply  successive  relational  tests  to  elements  in  a  list. 

THE  FIGURE  SHOWS  CONCLUSIVELY  THAT  PROCESSING  TIME  PER  ITERA¬ 
TION  DECREASES  IF  THE  DATA  REMAINS  IN  AM.  This  fact  is  a  simple  consequence  of 
prorating  the  load/unload  penalty  over  iterations.  Further,  it  should  be  obvious  that  this 
fact  can  be  generalized,  since  it  should  always  be  correct  to  prorate  the  load/unload 
penalty  over  successive  Iterations. 


In  Figure  1-4  the  dotted  and  solid  lines  cross.  This  point  can  be  called 
the  iteration  breakpoint.  For  almost  til  query  functions  over  1,000  data  words,  an 
iteration  hr  -ak*.  "nt  -ts.  At  the  iteration  breakpoint  for  the  given  data  volume,  it 
is  more  efficient  to  use  an  AM  for  query  processing  than  to  use  the  conventional  pro¬ 
cessor.  It  should  be  noted,  however,  that  only  relatively  complex  queries  will  re¬ 
quire  enough  iter,  ions  over  ♦’  data  to  "how  the  AM  to  advantage. 

The  above  concius-or  ts  iu-~f'  er  orroborated  by  Figure  1-5,  which  shows 
the  effect  of  prorating  the  load/  un’.oau  penalty  over  successive  iterations  for  three 
other  query  processing  routines. 

IN  THE  SEA  S  UR  VEIL  LA  NTCE  SYSTEM.  THE  QUERIES  WERE  RELATIVELY 
SIMPLE.  HENCE.  THE  GAP  AND  A2  MEMORIES  ARE  LESS  EFFICIENT  THAN  A  CON- 
VENTIONAL  MEMORY  SYSTEM.  ON  THE  OTHER  HAND.  IT  IS  EVIDENT  THAT  QUERIES 


CAN  BE  FORMULATED  WHICH  ARE  BETTER  PERFORMED  IN  THE  AM  THAN 
CONVENTIONALLY. 


This  fact  is  a  simple  consequence  of  the  existence  of  iteration  breakpoints,  and 
the  legitimacy  of  prorating  the  load/unload  penalty  over  successively  iterated  AM 
operations. 

The  entire  problem  of  the  load/unload  penalty  arises  because  of  the  way  in  which 
A2  and  GAP  are  configured.  For  both  A2  and  GAP,  data  input  to  the  system  enters  the 
AM  after  a  brief  pause  in  the  1604  core.  The  routing  of  data  through  the  1604  is  a  result 
of  the  ability  of  either  the  DMA  channel  or  Channel  7  to  communicate  directly  with  an 
input-output  channel.  Because  the  data  must  be  routed  through  the  1604  and  then  loaded 
into  the  FM,  the  load/ unload  penalty  exists. 

Configuration  A3  does  not  require  that  data  be  routed  through  the  1604.  Data 
can  be  read  directly  into  the  A3  from  the  peripheral  device,  with  only  a  small  penalty  due 
to  the  difference  of  the  cycle  times  bejween  core  and  the  AM  memory.  A3  WILL  PROVE 
TO  BE  ADVANTAGEOUS  FOR  QUERY  PROCESSING  WITH  LESS  COMPLY.-:  QUERIES 
THAN  REQUIRED  FOR  A2  AND  GAP.  HENCE.  THE  A3  CONFIGURATi  ■’!  S  "MORE 
EFFICIENT"  THAN  EITHER  GAP  OR  A2  FOR  PROBLEMS  OF  THIS  KIND. 
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Figure  1-5.  Iteration  Graphs  for  Three  Query  Subroutines,  TIME,  EQU,  and  AND 


The  time  required  to  execute  the  simple  questions  of  sea  surveillance,  through 
query  procesing  alone,  is  shown  in  Figure  1-6.  Queries  of  sufficient  complexity  to  show 
GAP  and  A3  to  advantage  were  not  invented  or  timed  as  part  of  the  study. 

It  should  be  noted,  however,  that  the  full  realization  of  the  A3  potential 
for  minimizing  the  load/unload  penalty  of  GAP  and  A2  requires  the  development  of 
a  relatively  sophisticated  input-output  routine,  in  order  to  transfer  data  directly 
between  the  disc  and  AM. 

GAP,  as  distinct  from  both  A2  and  A3,  suffers  one  other  penalty  besides 
the  load/unload.  This  penalty  may  be  called  the  transfer  penalty,  which  is  paid  in 
the  microseconds  required  each  time  GAP  is  to  begin  processing  a  particular  sequence 
of  GAP  coding  and  in  the  microseconds  required  to  test  the  condition  of  the  AM  when 
a  coding  sequence  is  complete.  Compared  to  the  load/unload  penalty,  this  penalty  is 
minor,  and  is  largely  alleviated  by  a  minor  change  recommended  for  GAP,  the  inter¬ 
rupt  on  halt. 

1.  2.  3  Concept  Findings 

The  availability  of  an  associative  memory  changes  the  thinking  of  systems 
designers,  and  suggests  alternative  solutions  to  problems.  The  ability  of  such  a 
device  to  stimulate  the  exploration  of  different  ideas  is  perhaps  its  most  important 
contribution  to  the  field.  Unfortunately,  only  a  few  of  the  various  ideas  suggested  by 
the  memory  could  be  explored  in  a  study  of  this  kind.  Avenues  of  further  research 
thought  to  be  of  promise  are  presented  in  Paragraph  1.  3  of  this  volume.  This  para¬ 
graph  li6‘s  some  of  the  conceptual  findings  actually  used  in  the  course  of  the  study. 

The  order  and  location  of  data  within  records  are  usually  two  of  the  important 
keys  to  processing  and  understanding  the  meaning  of  each  item  of  data.  The  associa¬ 
tive  memory  has  the  capability  to  process  data  without  regard  to  the  relative  location 
of  a  datum  within  a  record,  by  referencing  the  datum  in  terms  of  its  contents  or,  in 
the  case  of  the  data  base  actually  used,  in  terms  of  the  name  of  the  datum.  THF  DE¬ 
SIGNER  OF  THE  DATA  RECORD  WAS  FREED  FROM  CONSIDERATIONS  OF  ORDER 
AND  RELATIVE  LOCATION  WITHIN  RECORDS. 


1-29 


QUE8TION 

GAP 

1604-B 

A3 

(best  case  only) 

How  many  ships  are 
within  r  miles  of 
point  p  ? 

361.3 

357.9 

358.7 

Are  any  U.  S.  submarines 
in  area  •> 

372.9 

365.5 

367.2 

How  soon  can  DU789 
reach  Bermuda  under 
normal  SOA  ? 

28.6 

17.9 

23.4 

VTiere  is  Admiral 
now'/ 

10.9 

7.0 

8.3 

Are  any  ships  scheduled 
to  be  in  the  pr  *ectpd 
vicinity  of  Typhoon  juotti* 
in  the  next  few  da  vs? 

361.3 

i  -  -  . . . -  . 

357.9 

358.7 

Aircraft  or  ship,  *ith 
doctor  aboard,  nearer* 
in  time  to  point  x.  y  ' 

i 

1 

i  118.7 

l 

97.4 

100.0 

What  ships  with  long- 
range  radar  could  be  in 
area  by  2400 

tomorrow? 

’15.9 

| 

1 

90.0 

/ 

97.7 

Figure  1-6.  Timing  of  Representative  Sea  Surveillance  Questions 
(mill iseco ids  query  processing  only). 


The  availability  of  the  associative  memory  had  little  effect  on  the  overall 
design  of  the  data  files,  even  though  it  had  a  significant  effect  on  the  design  of  data 
records.  BECAUSE  THE  DISC  STORAGE  UNIT  SELECTED  FOR  FILE  STORAGE 
WAS  RELATIVELY  SLOW.  AND  IMPOSED  FORMATTING  CONSTRAINTS.  THE  AVAIL¬ 
ABILITY  OF  THE  AM  WAS  LESS  SIGNIFICANT  IN  THE  OVERALL  DESIGN  OF  FILES 
THAN  WERE  THE  PECULIARITIES  OF  THE  818  DISC.  An  AM  can,  however,  be  ex¬ 
pected  to  influence  overall  file  design  in  other  situations,  and  a  research  project  in 
this  area  is  recommended  in  Paragraph  1.  3. 

The  language  of  programmers  is  changed  by  the  availability  of  an  AM.  Terms 
like  "response  vectors,  "  "tag  memory,  "  and  the  other  words  and  phrases  of  the  AM 
technology  become  current.  Surprisingly,  no  basic  changes  are  made  in  the  program¬ 
mer's  idea  of  bow  to  make  flowcharts  and  how  to  organize  algorithms.  THE  EXIST¬ 
ENCE  OF  THE  AM  DID  NOT  REQUIRE  CHANGES  TO  FLOWCHARTING  CONVENTIONS 
Oil  TO  THE  STATEMENT  OF  ALGORITHMS.  Programmers  used  flowcharting  symbols 
and  various  techniques  according  to  their  own  preference,  with  no  noticeable  need  to 
change  these  basic  tools. 

Because  implementation  experience  is  required  to  test  our  ideas  about  associa¬ 
tive  processing,  problems  should  be  implemented  for  the  GAP  hybrid  as  soon  as  it  be¬ 
comes  available.  The  SEA  SURVEILLANCE  PROBLEM  IS.  HOWEVER.  TOO  LARGE 
FOR  A  FIRST  CUT  AT  IMPLEMENTING  AN  ASSOCIATIVE  PROCESSOR  PROBLEM. 

Some  manageable  subproblem  of  potentially  high  yield  could  be  selected  for  implementa¬ 
tion,  however. 

1.3  CONCLUSIONS  AND  RECOMMENDATIONS 

1.3.1  Introduction 

The  purpose  of  this  paragraph  is  to  present  certain  conclusions  and  recom¬ 
mendations  in  a  summary  form.  These  conclusions  are  noted  in  capital  letters  and 
underscored.  Volume  H  and  other  paragraphs  of  this  volume  provide  supporting  in¬ 
formation  for  these  conclusions  and  recommendations.  The  subject  items  of  this 
paragraph  will  be  considered  from  these  viewpoints: 

(1)  The  sea  surveillance  problem  and  the  hybrid  associative  memory 
system. 
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(2  rhf.  Goodyear  Associative  Processor. 

(i  '  •  a  .*e  Efforts. 

1. 3. 2  The  Sea  Surveillance  Problem  and  the  Hybrid  Associative  Configuration 
By  definition,  there  are  three  main  components  in  a  hybrid  associative 


memory  system: 

{D 

The  central  processor. 

(2) 

The  associative  memory. 

(3) 

Peripheral  storage  devices. 

These  three  components  may  be  considered  as  axes  in  an  overall  system;  that  is, 
every  component  may  be  varied  to  some  extent.  A  convenient  way  of  showing  how 
these  may  be  varied  is  to  develop  a  recommended  hybrid  system  for  the  sea  surveil¬ 
lance  problem. 

The  sea  surveillance  problem  is  a  problem  which  requires  the  manipulation 
of  a  large  data  base.  The  volume  of  data  requires  the  use  of  a  peripheral  storage 
device;  the  order  of  data  required  demands  the  random  access  features  of  either  a  drum 
or  a  disc.  From  a  programmer's  viewpoint,  the  choice  of  drum  or  disc  is  important 
if  the  data  processing  required  must  wait  for  data  to  be  accessed  and  stored  in  the  proc¬ 
essor.  What  is  desired  is  a  system  balanced  timewise  between  data  access  time  and 
processing  time.  In  this  respect,  a  disc  file  similar  to  THE  IBM  1302  DISK  UNIT  IS 
RECOMMENDED  for  problems  of  the  sea  surveillance  type. 

The  selection  of  a  central  processor  may  be  made  >n  an  evolutionary  fashion, 
that  is,  by  first  supposing  that  the  associative  memory  did  not  exist.  (The  later  addi¬ 
tion  of  the  associative  memory  processor  will  not  detract  from  this  initial  selection 
of  the  central  processor. )  Because  of  the  voluminous  amounts  of  data  manipulation 
required,  THE  BEST  CENTRAL  PROCESSOR  FOR  THE  SEA  SURVEILLANCE  PROB¬ 
LEM  HAS  THESE  FEATURES: 

(1)  GATHER  READ,  SCATTER  WRITE  INSTRUCTIONS. 

(2)  CQRE-TO-CORE  MOVE  OF  BLOCK  SIZED  DATA. 
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Notice  that  this  does  not  include  the  CDC  1604-B.  Therefore,  THE  CPC  1604-B  IS 
NOT  THE  BEST  CENTRAL  PROCESSOR  that  could  be  chosen.  ‘ 

The  selection  of  an  associative  meraoiy  requires: 

(1)  A  decision  to  determine  whether  the  associative  memory  will  be 
beneficial. 

(2)  A  determination  of  the  required  features  of  this  memory. 

The  first  requirement  may  be  answered  in  the  affirmative;  that  is,  AN  AS¬ 
SOCIATIVE  MEMORY  PROCESSOR  IS  AN  ASSET  IN  EFFECTING  problems  of  the  sea 
surveillance  type.  This  follows  from  the  decrease  in  programming  analysis  and  en¬ 
coding  required  because  of  die  relaxed  data  format  and  ordering  constraints  needed  in 
a  hybrid  system.  The  choice  of  the  associative  processor  depends  upon  its  interface 
features  with  the  central  processor  and  its  ability  to  manipulate  data.  In  this  respect, 
THE  GOODYEAR  ASSOCIATIVE  PROCESSOR  SHOULD  NOT  BE  USED  (IF  A  CHOICE 
MAY  BE  MADE)  FOR  PROBLEMS  OF  THE  SEA  SURVEILLANCE  TYPE,  for  these 
reasons: 

(1)  Difficulty  in  data- dependent  operations. 

(2)  Lack  of  interrupt  on  halt  i.  .truction, 

(3)  Cannot  take  advantage  (because  of  the  preceding  two  reasons) 
of  multiprocessing  aspect. 

(4)  Special  channel  not  warranted  —  adds  to  cost. 

(5)  Sophistication  (for  example,  luxury  search  instructions)  not 
required. 

The  best  associative  processor  is  not  described  explicitly  in  this  report. 

It  is  one  processor  using  the  interface  method  of  another  processor;  that  is,  THE 
BEST  ASSOCIATIVE  MEMORY  PROCESSOR  FOR  PROBLEMS  OF  THE  SEA  SUR¬ 
VEILLANCE  TYPE  HAS  THE  FOLLOWING  FEATURES  OF  THE  A2  PROCESSOR  AND 
THE  INTERFACE  OF  THE  A3  PROCESSOR: 

(1)  TAG  BITS. 

(2)  RESPONSE  STORE  MANIPULATION. 

(3)  PREFIX  REGISTER. 


1-33 


In  comparing  two  systems  which  differ  in  that  one  has  an  associative 

memory  and  one  does  not,  it  may  result  that  processing  time  requirements  will  be 

of  the  same  magnitude.  In  this  study,  it  was  found  that  the  GAP  hybrid  system  was 

indeed  slower  by  about  5-10  percent  than  the  non-hybrid  system.  This  result  was 

? 

generated  by  the  fact  that  the  interface  ai  rangement  required  that  data  be  trans¬ 
ferred  between  the  disc  and  the  associative  processor  through  the  1604-B  memory. 

If  the  AS  interface  arrangement  is  employed,  the  time  requirements  will  not  be 
improved  significantly  or.  an  overall  processing  basis  since  the  data  access  times 
remain  constant.  Thus,  a  decision  of  choice  between  the  two  systems  should  not  be 
based  on  time  requirements  alone.  Even  at  the  sacrifice  of  some  processing  time, 
it  was  found  that  EVERY  HYBRID  SYSTEM  STUDIED  RESULTED  IN  THESE  AD¬ 
VANTAGES: 

(1)  REDUCES  PROGRAMMING  ANALYSIS  AND  ENCODING  WITH 
RESPECT  TO  DATA  ORDERING. 

(2)  ADMITS  POWERFUL  POTENTIAL  IN  PROCESSING  QUERIES. 

(3)  PERMITS  SIMPLIFICATION  OF  DATA  BASE  MAINTENANCE 
PROCESSING. 

It  should  be  remembered  that  the  sea  surveillance  problem  used  in  this 
study  is  not  the  operational  sea  surveillance  problem,  because  of  security  restric¬ 
tions;  the  problem  considered  may  bear  little  resemblance  to  the  real  problem. 
Additionally,  this  study  has  designed  a  system  that  is  biased  towards  the  hybrid  sys¬ 
tem  composed  of  the  CDC  1604-B  central  processor,  the  CDO  818  Disc  File,  and  the 
Goodyear  Associative  Processor.  For  these  reasons,  IT  IS  NOT  CERTAIN  THAT  AN 
ASSOCIATIVE  PROCESSOR  SHOULD  BE  USED  FOR  THE  OPERATIONAL  SEA  SUR¬ 
VEILLANCE  PROBLEM. 

After  many  systems  are  designed,  a  natural  evolutionary  process  is  to  en¬ 
code  these  systems  to  obtain  empirical  results.  It  is  felt  that  the  system  developed 
in  this  report  is  much  too  large  for  such  an  undertaking.  Therefore,  IT  IS  NOT  REC¬ 
OMMENDED  THAT  THE  ENTIRE  SEA  SURVEILLANCE  SYSTEM  GIVEN  IN  THIS  RE¬ 
PORT  BE  ENCODED  TO  ACHIEVE  EMPIRICAL  RESULTS.  On  the  other  hand, 
system  reported  on  contains  many  interesting  subsystems  which  are  also  useful  in 
other  applications.  For  example,  the  query  language  translator  (Paragraph  7.  2. 1, 
Volume  II)  consists,  in  the  main,  of  processing  which  converts  a  query  language 


statement  into  a  canonical  Polish  prefix  form;  the  query  pre-processor  accepts  this 
Polish  prefix  form  and  "compiles"  it  into  a  command  list;  the  query  run  routine 
accepts  the  command  list  as  input  and  performs  it  in  an  interpretive  fashion.  These 
subsystems  in  themselves  arc  applicable  to  problems  other  than  sea  surveillance. 
Therefore,  IT  IS  RECOMMENDED  THAT  SOME  SUBSYSTEM  BE  ENCODED  FOH  THE 
CPC  818  DISC  FILE.  CPC  1604-B  CENTRAL  PROCESSOR.  AND  THE  GAP  TO  OBTAIN 
EMPIRICAL  RESULTS. 

1.3.3  The  Goodyear  Associative  Processor 

Before  any  encoding  is  started,  IT  IS  STRONGLY  RECOMMENDED  THAT 
A  COMPREHENSIVE  PEPAGOGICALLY  SOUND  PROGRAMMING  MANTJAL  BE  WRITTEN 
FOR  THE  GOODYEAR  ASSOCIATIVE  PROCESSOR.  This  manual  should  contain  at  a  nvni- 
mum  those  items  noted  in  Paragraph  3.  3.  2. 

It  is  perhaps  too  late  in  the  development  of  the  GAP  to  make  many  (if  any) 
changes  in  the  logic.  However,  several  such  changes  may  be  recommended  for  the 
first  and/or  future  processors.  These  changes  arc  listed  in  a  utility  order: 

IT  IS  RECOMMENDED  THAT  THE  GAP  BE  MODIFIED  TO: 

*  PROVIDE  AN  INTERRUPT  TO  THE  1604-B  A1  TER  EXECUTING 
THE  HALT  INF"  RUCTION. 

*  INSERT  THE  ADDRESSES  AND  COUNT  OF  RESPONDERS  INTO  A 
1604-B  CORE  MEMORY  WITHOUT  ZERO  FILLING. 

*  PROVIDE  THE  ABILITY  TO  READ  DIRECTLY  FROM  PERIPHERAL 
STORAGE  INTO  GAP. 

*  PROVIDE  INDIRECT  ADDRESSING  AND/OR  INDEX  MODIFICATION 
OF  ASSOCIATIVE  MEMORY  A  DDR 

*  PERMIT  READ/WRITE  OPERATIONS  Oi  ELEMENTS  OF  THE  D 
AND  E  REGISTERS. 

The  first  two  modifications  are  most  important.  If  effected,  such  changes  would  onhano 
the  multiprocessing  features  of  the  system  by  permitting  effective  control  (the  Halt  in¬ 
struction  —  interrupt  change)  r_nd  removing  a  need  for  intermediate  1G04-B  operations 
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to  form  subsequent  instructions  to  load  (or  unload)  the  memory>  determine  data  flow, 
etc.  The  remaining  changes  would  enhance  the  GAP  if  they  were  effected  but  these 
are  not  essential  changes.  It  should  be  noted  that  the  first  recommended  change  is 
the  most  important,  the  easiest  to  effect*,  and  the  least  expensive. 

13.4  Future  Studies 

Throughout  the  course  of  this  study,  several  questions  arose  which  could  not 
be  answered  directly,  if  at  all,  because  of  time  requirements.  These  questions  were 
of  such  a  complex  nature  that  the  answers  desired  warranted  a  separate  study  of  their 
own.  These  questions  then  are  the  genesis  for  recommended  future  studies.  The 
following  paragraphs  have  attempted  to  form  each  study  as  an  independent  investiga¬ 
tion. 

1.  3.  4. 1  Other  File  Structures.  The  current  study  focused  attention  on  one  particular 
file  structure:  a  multi-list  file  structure  in  which  the  links  usuaHy  contained  in  data 
record:-  are  extracted  and  placed  in  directories.  Other  file  structures  should  be  in- 
'•estig;  ‘  ad  to  determine  the  effectiveness  of  AMs  for  different  file  structure  types. 

Some  examples  of  file  structures  are: 

(1)  Fixed  field,  fixed  size  data  records. 

(2)  Variable  field,  variable  record  size. 

(3)  Fixed  field,  variable  record  size. 

(4)  Files  with  records  embedded  within  records. 

For  each  type  of  file  structure  the  problems  of  query,  maintenance,  and  data  bast* 
loading  should  be  investigated.  The  effectiveness  of  an  AM  for  problems  with  such 
files  could  be  assessed,  and  the  importance  of  the  AM  interface  and  guidelines  could  be 
determined  to  indicate  when  an  AM  approach  would  improve  the  data  processing  for 
each  file  type.  A  trade-off  study  should  be  conducted  to  indicate  the  point  where  the 
AM  utility  is  important.  For  example,  if  record  sizes  for  a  fixed  field,  fixed  size 


•A  possible  method  for  accomplishing  this  change  is  to  interchange  the  interrupt 
caused  by  the  parity  error  'ight  and  the  resetting  of  the  AM  active  line  by  the 
Halt  instruction.  This  may  be  as  simple  as  the  interchanging  of  two  wires. 
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data  record  were  such  that  only  one  data  record  could  be  placed  into  an  AM,  then 
the  AM  would  not  be  an  effective  device  for  such  problems  since  a  fixed  record 
obviates  the  "associative”  need  of  the  AM.  On  the  other  hand,  if  the  record 
data  description  were  placed  in  AM,  the  programs  could  be  written  in  an  inter¬ 
pretive  fashion  and  querying  the  AM  would  isolate  the  precise  location  of  the 
data  in  the  record. 

Investigation  or  implementation  of  a  comprehensive  system  would  not  be 
appropriate  for  this  type  of  study.  More  would  be  accomplished  by  developing  test 
problems  and  producing  careful  timing  analyses.  The  study  should  analyze  the  variant 
AMs  investigated  during  this  current  effort.  If  one  AM  is  considered,  it  should  be 
either  the  GAP  or,  preferably,  the  A2/A3  configuration.  The  advantage  of  choosing 
test  problems  judiciously  is  that  they  can  actually  be  implemented  in  an  AM  without 
great  expense.  Much  will  be  learned  about  AM  techniques  when  ore  actually  exists 
and  is  utilized  for  experimentation.  Depending  upon  the  results  obtained  from  such 
a  study,  a  specific  application  of  a  data  base  of  importance  to  intelligence  could  be 
implemented.  This  would,  however,  be  a  second  phase  to  this  study  rather  than  a 
first  phase.  A  RECOMMENDED  STUDY  IS  TO  EXAMINE  THE  IMPACT  OF  AN  AS¬ 
SOCIATIVE  MEMORY  (EITHER  THE  GAP  AND/OR  THE  \2  LOGIC  WITH  THE  A3 
INTERFACE)  UPON  THE  NOTED  FILE  STRUCTURES. 

1.3.4.  2  Executive  Control  Programs,  in  this  study,  several  executive  type  pro¬ 
grams  were  required.  For  example,  the  query  controller  and  query  input  function  to 
control  the  allocation  and  input  of  data  from  the  disc  file  into  the  cere  memory;  the 
AM  Dispatcher  determines  and  effects  the  segmentation  and  processing  control  of  a 
functional  subroutine  and  its  inputs,  outputs,  etc.  In  all  cases,  data  was  transferred 
from  the  disc  file  into  the  associative  memory  through  the  1604-B  core  memory.  As 
noted  in  this  study,  to  take  advantage  of  the  A3  configuration  a  centralized  program 
must  control  input-output  fro  core  to  disc  so  that,  under  instructions  from  the  pro¬ 
grammer,  data  ms.y  be  sent  directly  to  the  associative  memory  (or  from  the  associa¬ 
tive  memory  to  disc)  without  wasting  a  transfer  to  another  section  of  core.  From 
these  examples,  it  is  clear  that  consideration  must  be  given  to  the  executive  control 
program  which  resides  in  core  *o  enhance  the  system  use  of  an  AM.  The  executive 
control  program  must  further  be  able  to  analyze  error  conditions  that  arise  from  the 
associative  memory  so  as  to  be  able  to  take  corrective  action  whenever  possible. 
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The  study  noted  in  the  previous  paragraph  is  in  itself  not  ver  large.  It 
should,  however,  be  broadened  to  investigate  how  an  executive  contro*  program 
might  take  advantage  of  an  associative  memory.  Certain  tables  required  by  the 
executive  control  program  might  reside  permanently  in  AM  (e.  g. ,  memory  maps  of 
core  and  disc  which  show  the  available  space  as  required  for  dynamic  memory 
allocation).  Speeding  up  tab'e  lookups  could  decrease  the  overhead  time  of  executive 
control  programs.  On  the  other  hand,  the  increased  cost  of  using  an  AM,  and  the 
decreased  AM  space  for  other  processing  functions  may  not  be  warranted.  Trade-off 
studies  should  indicate  the  importance  of  evaluating  the  use  of  AMs  for  executive  control 
programs,  and  more  broadly  for  the  use  of  AMs  in  a  multi-programming  environment. 

IT  IS  RECOMMENDED  THAT  EXECUTIVE  CONTROL  PROGRAMS  WHICH 
SERVICE.  -ND  JSE  THE  ASSOCIATIVE  MEMORY  AND  COORDINATE  MULTIPROCESS¬ 
ING  BE  INVESTIGATED. 

1.  3.  4.  3  The  A2/A3  Configuration.  As  noted  (Paragraph  1.  3.  2),  based  upon  this 
study,  it  would  appear  that  the  A2  configuration  as  interfaced  with  the  central  processor 
in  A3,  would  be  the  most  advantageous  for  problems  such  as  sea  surveillance.  The 
current  stud}  did  not  permit  sufficient  time  to  explore  this  combined  configuration. 

Before  such  a  configuration  is  implemented,  further  studies,  such  as  listed  below, 
should  be  made. 

(1)  Detail  the  logic  associated  with  the  A2/A3  configuration  and  develop 
a  programming  manual  which  can  be  utilized  bv  programmers. 

(2)  Develop  comparative  timing  of  the  \1  A3  configuration  for  the 
sea  surveillance  problem. 

(3)  implement  test  problems  for  the  A2/A3  ec  .iguration  other  than 
the  sea  surveillance  problem.  For  example.  RADC  has  sponsored 
work  on  other  problems  for  the  GAP  configuration.  These  problems 
should  he  coded  for  tiu  A2-  A3  configurations.  If  an  advantage 
were  indeed  evidenced,  it  would  then  be  appropriate  to  sponsor 
implementation  of  the  A2  A3  configuration  oriented  towards  an 
appropriate  central  piocessor 
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IT  IS  RECOMMENDED  THAT  THE  A2/A3  HYBRID  C  ON  FIGURATION  BE 
INVESTIGATED  TO  THE  SAME  LEVEL  OF  DETAIL  AS  THE  GAP,  A2.  AND  A3, 

1. 3. 4. 4  Alternative  Associative  Memory  Configurations.  In  developing  the  other 
(A2  and  A3)  associative  processors  considered  in  this  report,  other  processor  con¬ 
figurations  were  considered.  Several  of  these  appear  to  hrve  special  promise  and 
interest  and  should  be  considered  for  study.  These  are: 

(1)  A  multiprocessor  with  full  parallel  logic. 

(2)  A  multiprocessor  with  response  store  in  thin  film. 

(3)  An  integrated  system  with  response  store  in  thin  film. 

(4)  A  special  input-output  or  search  controller  with  full  parallel 
hardware. 

(5)  A  special  input-output  or  search  controller  with  response  store 
in  thin  film. 

The  studies  would  be  both  hardware  and  software  oriented.  The  development  of  the 
technology  to  achieve  a  response  store  in  thin  film  will  be  primarily  hardware  oriented. 
This  technology  could  permit  greater  manipulation  of  tag  bits  at  a  relatively  inexpensive 
cost.  Use  of  an  AM  as  a  special  I/O  controller  could  decrease  input-output  time,  which 
is  a  ma*or  par?  of  the  sea  surveillance  and  other  command  and  control  type  problems. 

IT  IS  RECOMMENDED  THAT  THE  ABOVE  STUDIES  BE  INSTITUTED  BE¬ 
FORE  ONE  VENTURES  OUT  TO  UTILIZE  AN  ASSOCIATIVE  MEMORY  IN  A  SIGNIFICANT 
APPLICATION.  A  great  deal  can  be  gained  from  pap  :r  studies  at  a  smaller  cost  than 
implementing  a  complex  application.  The  recomr  .  ..ded  procedure  study  is  to: 

Perform  paper  design  analyses. 

Specify  the  features  of  a  small  AM  for  general  utility. 

Develop  the  AM. 

Develop  programming  tools. 

Implement  simple,  well-defined  problems  and  make  comparative 
studies. 

Utilize  the  AM-hybrid  system  for  applications. 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 
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U.  A  5  Compiler  Study.  To  take  advantage  of  an  associative  memory,  the  program¬ 
mer  should  have  available  a  language  in  which  he  can  write  his  programs  conveniently. 
No  programming  language  currently  permits  programmers  to  write  statements  for  an 
AM  in  a  language  and  have  these  statements  compiled  to  generate  object  code. 

With  the  pre^chi  stale  oi  our  technology  it  is  not  feasible  to  permit  he  pro¬ 
grammer  to  write  statements  independent  of  whether  the  data  is  in  core  or  AM  (this 
applies  equally  to  A3  where  AM  functions  as  a  conventional  memory).  Whether  a  data 
element  should  be  processed  in  core  or  in  AM  can  be  decided  only  by  the  programmer. 
To  attempt  to  develop  a  program  which  does  not  require  the  programmer  to  specify  the 
data  locations  is  a  major  research  effort  which  may  have  no  solution  and  is  therefore 
not  recom  ucnded.  The  stud^  should  be  conducted  for  the  alternative  configurations 
studied  i"  f\is  .eport  and  for  the  recommended  A2/A3  configuration. 


THE  MAIN  AREAS  OF  THE  RECOMMENDED  STUDY  AND  INVESTIGATION 
ARE  THE  DESIGN  OF  THE  COMPILER/INTERPRETER.  ANALYSIS  OF  MULTIPROC¬ 
ESSING  CAPABILITIES,  AND  EVALUATION. 


study: 


The  following  specific  tasks  should  be  accomplished  in  each  part  of  this 

(1)  Design  of  Compiler/Interpreter 

(a)  Determine  modification,  if  any,  to  a  typical  language  re¬ 
quired  to  effectively  utilize  the  associative  memory. 

(b)  Determine  the  functions  to  be  performed  by  the  associa¬ 
tive  memory. 

(c)  Determine  whether  the  compiler  'interpreter  should  be  a  com¬ 
piler  or  an  interpreter,  or  both. 

(d)  Determine  necessary  language  extensions  required  to  permit 
the  language  user  to  control  associative  memory  functions 
(e.  g. ,  a  declarator  statement  which  specifies  that  a  block  of 
code  is  to  be  AM  code,  or  conventional  code  is  desirable. 

The  statement  would  have  the  form  (CODE  IS  j  STANDARD  }>' 

A  corresponding  delaration  having  the  function  of  putting 
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data  into  AM  might  have  the  form  DATA  IS  {  }  . 

Additionally,  statements  to: 

(i)  IDENTIFY 

(ii)  FETCH 

(iii)  ERASE 

(iv)  SUBSTITUTE 

(v)  INSERT  and 

(vi)  CHANGE  would  be  required  as  a  minimum. 

(e)  Determine  the  most  effective  techniques  for  utilizing  associa¬ 
tive  memories  and  features  of  an  associative  memory  that 
would  enhance  the  compiler /interpreter  process  (e.  g. , 
determine  whether  the  AM  should  be  utilized  during  the 
compilation  phase  and  how  it  would  be  utilized). 

(2)  Multiprocessing  Capabilities.  Review  existing  control  program 
for  a  typical  processor  to: 

(a)  Determine  modifications  required  to  extend  executive  con¬ 
trol  over  the  associative  memory. 

(b)  Determine  the  interdependence  between  the  compiler/inter¬ 
preter  and  the  executive  control  during  compile  and/or  run 
time. 

(c)  Determine  the  additional  features  required  by  the  control 
program  to  aid  in  the  manipulation  of  an  object  code  developed 
by  the  compiler/interpreter. 

(3)  Evaluation 

(a)  Comparative  evaluation  of  alternative  AM  interfaces  for  the 
compiler /interpreter  processes. 

(b)  Extended  language  developed  in  Baohus  Normal 
Form. 

(c)  Programming  Manual. 

(d)  System  design  of  the  most  effective  compiler /interpreter. 


V 
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(e)  Effective  techniques  for  AM  manipulation. 

(f )  Recommended  changes  to  the  executive  routine  and  sub¬ 
stantiated  reasons. 

If  warranted,  an  implementation  phase  could  be  instituted. 

1. 3. 4.  6  Investigate  Large  Associative  Memories.  The  findings  and  conclusions  re¬ 
ported  on  in  this  study  do  not  necessarily  apply  to  large  associative  memories.  The 
technology  associated  with  large  AMs  will  certainly  be  substantially  different  from  that 
of  small  AMs  that  exist  today.  Several  problems  must  be  studied  with  respect  to  large 
AMs.  These  include: 

(1)  How  will  the  organization  of  computers  be  affected  by  AMs?  As 
noted  in  GAP,  data-dependent  processing  is  restricted  by  transfers 
to  and  from  GAP  and  conventional  core. 

(2)  What  should  the  instruction  repertoire  of  such  systems  be  ?  By 

having  programmers  involved  in  the  early  stages  of  such  investigations, 
some  of  the  errors  made  with  GAP  (assuming  that  sea  surveillance 
is  the  problem  to  be  implemented)  can  be  avoided. 

(3)  What  applications  will  a  large  associative  memory  assist?  There 
have  been  several  papers  which  conjecture  that  a  large  AM  will 

be  effective  for  such  problems  as  data  storage  and  retrieval,  natural 
language  manipulating  and  fact  retrieval.  It  is  not  entirely  clear 
that  a  large  AM  will  make  significant  impact  on  such  problems. 
Currently,  work  in  natural  language  manipulation  and  fact  retrieval 
requires  more  efforts  in  linguistic  analysis  in  which  associative 
memories  certainly  annot  help. 
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SECTION  II.  DISCUSSION  AND  COMPARATIVE  ANALYSES 
OF  ASSOCIATIVE  MEMORIES 


L  ASSOCIATIVE  MEMORIES 

Every  item  stored  in  the  memory  of  a  computer  system  may  be  considered  as 
a  pair  of  numbers:  a  data  word  and  an  address.  The  data  word  may  be  interpreted  in 
many  different  ways:  as  a  binary  number,  a  decimal  number,  an  alphanumeric  character 
or  ^ther  symbol,  a  command,  any  part  of  any  of  these  things,  or  a  combination  of  several 
of  them,  either  homogeneous  or  heterogeneous.  The  address  refers  to  the  physical  device 
in  which  the  data  word  is  stored.  It  may  be  accessed  directly  or  indirectly,  relatively  or 
absolutely ,  with  different  groupings  of  storage  elements  possible,  as  in  variable  word  length. 

These  two  numbers  are  usually  handled  very  differently.  In  the  vast  majority 
of  applications,  data  words  are  manipulated,  arithmetically  or  logically,  in  the  program, 
and  the  results  located  in  memory  by  their  addresses.  The  data  words  may  themselves 
include  addresses,  and  great  flexibility  is  afforded  by  the  manipulation  of  addresses  in  the 
program.  But  even  in  sophisticated  routines  involving  indexing,  the  ultimate  result  is 
usually  the  retrieval  of  data  words  from  specified  memory  locations  at  the  right  times  in 
the  program.  And  each  data  word  is  usua’iy  read  from  memory  with  no  regard  whatsoever 
for  its  content. 

The  dissolution  of  the  barrier  between  data  and  addresses  can  add  greater 
flexibility  to  a  system’s  capability.  When  memory  is  accessed  according  to  its  contents, 
certain  kinds  of  decisions  are  greatly  facilitated.  Addressing  by  content  may  be  employed 
in  either  writing  or  reading.  Words  may  be  placed  in  memory  according  to  the  nature 
of  the  data  they  include,  so  that  the  physical  address  implies  something  about  the  content. 

Or  words  situated  arbitrarily  may  be  retrieved  according  to  their  content.  (The  human 
memory  uses  both  of  these  techniques  extensively  and  sometimes  inexplicably,  e. g. ,  "I’ll 
remember  the  data  George  Washington  was  born  because  it  has  the  same  digits  as  V3," 
or,  "I  don't  remember  his  name,  but  it  begins  with  a  Z  -  oh,  yes,  Fitzpatrick!") 

An  ” 'associative"  or  content-addressable  memory  can  be  either  a  software  or 
a  hardware  item.  The  circuits  in  a  memory  device  may  be  specifically  designed  to 
facilitate  addressing  bv  content.  Chi  the  other  hand,  any  of  the  operations  performed  on 
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or  by  such  a  memory  can  be  imitated  by  a  program  using  a  general-purpose  computer 
with  a  conventional  memory.  There  can  be  tremendous  disparity  in  the  time  and  space 
requirements  for  the  operation.  This  report  discusses  some  of  the  tradeoffs  involved 
in  the  choice  of  how  to  implement  associative-memory  operation. 

Loc-tion  in  memory  by  addresses  is  generally  unique;  that  is,  each  address 
corresponds  to  one  and  only  one  data  word.  Addressing  by  content  implies  the  consider¬ 
ation  of  more  than  one  data  word  at  a  time.  A  number  of  possible  candidates  is  examined, 
and  checked  as  to  their  content,  against  certain  criteria.  In  many  cases,  the  criteria  will 
not  apply  to  exactly  one  number,  but  may  be  met  by  many  or  by  none.  For  this  reason, 
the  basic  operations  in  an  associative  memory  are  not  only  transmissions  of  data,  but 
partitions  of  the  items  into  sets  according  to  the  outcomes  of  various  interrogations.  Each 
test  results  in  a  dichotomy,  dividing  the  words  into  sets  of  "responders"  and  non-responders. 
One  or  all  of  the  responders  may  be  retrieved  and  manipulated  after  a  test.  Or  the  sets 
defined  by  the  various  tests  may  be  combined  by  union  and  intersection  into  other  sets  that 
are  of  interest.  These  processes  call  for  one  or  more  registers  to  identify  sets  of  numbers, 
with  one  bit  in  a  register  for  each  word  in  the  associative  memory.  Logical  operations 
on  the  contents  of  such  registers  constitute  an  important  part  of  the  operation  of  the  AM. 

The  results  of  these  operations  in  turn  control  transfers  of  data,  by  means  of  such  opera¬ 
tions  as  "write  in  all  responders"  or  "reacj  first  non-responder. " 

Most  of  the  comparison  operations  make  use  of  one  or  more  reference  numbers 
or  comparands.  n  AM  hardware,  a  comparand  register  is  provided,  with  the  capability 
of  simultaneous  comparison  with  all  items  in  a  portion  of  the  AM.  The  operation  may  be 
bit  parallel  or  bit  serial;  the  latter  is  well-adapted  to  the  logical  requirements  of  most 
comparison  operations.  A  second  register  accompanies  the  comparator  register  for 
masking.  This  adds  the  important  facility  of  ignoring  some  bits  in  a  comparison.  It  is 
especially  useful  in  the  many  applications  where  data  words  consist  of  several  different 
parts  with  their  own  codes  and  meanings. 

The  basic  structure  of  an  AM  is  as  shown  in  Figure  2-1 .  The  memory  proper  holds 
W  words  of  B  bits  each.  The  comparand  and  mask  registers  have  B  bits  each,  and  the 
response  register(s)  W-bits.  Parity  bits  may  be  added  to  any  of  these  registers.  There 
may  also  be  one  "busy  bit"  per  word  to  indicate  whether  it  contains  significknt  data.  The 
busy  bits  constitute  a  specialized  response  register;  the  set  of  words  it  identifies  is  usually 
ANDed  together  with  a  set  of  words  satisfying  search  criteria  to  form  a  VV-bit  response 
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Comparand  Register 


word.  That  is,  the  intersection  of  the  set  of  responders  and  the  set  of  "busy"  words  is 
used  as  the  response  word.  The  register  of  busy  bits  has  inputs  to  allow  the  modifica¬ 
tion  of  these  bits  during  loading  and  unloading  of  the  memory.  Additional  busy-bit  regis¬ 
ters  could  be  added  for  further  refinements. 

The  basic  operations  on  the  contents  of  the  AM  are  identifications  of  sets  of 
numbers  that  pass  certain  tests.  A  word  is  set  up  in  a  response  register  containing  a 
ONE  for  each  word  that  is  a  member  of  the  set  and  a  ZERO  for  each  word  that  is  not. 

The  criterion  for  membership  in  the  set  is  usually  a  relationship  with  the  comparand: 
equal  to,  greater  than,  or  less  than.  There  are  also  operations  that  do  not  depend  on  a 
comparand,  such  as  maximum  or  minimum.  These  operations  often  identify  a  one-membt 
set,  but  there  may  be  more  than  one  location  in  memory  with  the  same  value. . 

The  basic  operations  are  easily  combined  into  other  tests,  such  as  unequal, 
greater  than  or  equal  to,  between  two  limits,  next  greater  than,  congruent  modulo  M,  etc. 
These  combinations  may  all  be  effected  by  logical  operations  on  response  words,  but  a 
number  of  them  may  be  built  in  as  single  operations,  depending  on  the  hardware's  facilities 

The  contents  of  response  registers  are  used  in  two  ways.  They  are  logically 
combined  with  each  other  to  form  new  response  words.  And  they  are  used  to  control  trans 
fers  into  or  out  of  the  AM.  A  set  of  numbers  specified  by  a  response  word  may  be  read, 
written,  or  deleted.  Reading  or  writing  may  be  block  transfers  between  the  AM  and  the 
computer's  active  memory,  or  may  be  carried  on  successively,  with  operations  performed 
on  each  word. 

The  logical  combinations  of  response  words  are  limited  by  the  number  of  respons 
registers  available.  But  even  if  there  is  only  one  response  register,  a  certain  amount  of 
logical  manipulation  is  possible.  The  results  of  any  test  may  be  ORed  or  ANDed  with  the 
previous  contents  of  the  response  register,  as  a  part  of  the  procedure.  Multiple  registers 
permit  the  storage  of  both  words,  as  well  as  the  result  of  the  combination. 

The  response  registers  may  or  may  not  make  their  contents  available  to  the 
computer.  They  may  also  be  capable  of  producing  a  count  of  the  number  of  ONfEs  they 
contain.  This  count  is  needed  during  the  course  of  some  operations.  In  some  cases  it  is 
not  a  full  counting  capability,  but  a  three-valued  count,  to  show  whether  the  register  contain 
no  ONEs,  exactly  one,  or  more  than  one. 
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Th-s  repoi t  discusses  soni  oi  the  many  possible  configurations  of  an  associative 
memory,  with  particular  emphasis  on  the  different  ways  they  may  be  integrated  with  the 
1604-B  computer.  Tf'ase  range  from  treatment  of  the  AM  as  a  peripheral  device  to  a 
total  integration  of  the  AM  into  the  1604 -b  memory.  One  of  the  systems  is  Goodyear's  CAP. 
The  others  were  developed  by  AUERBACii  to  show  the  effects  of  changes  in  the  design. 

2. 1. 1  Goodyear  GAP 

The  Goodyear  associative  memory,  GAP,  has  a  capacity' of  2, 048  words  of  48  bits 
each,  plus  parity  and  busy  bits  (see  Figure  2-2).  Beside  the  busy  bits,  there  are  three 
response  registers  of  1,024  bits  each.  Each  operation  involves  either  the  upper  1,024 
words  or  the  lower  1,024  words.  Results  of  the  search  (ANPed  with  the  busy  bits)  are  formed 
in  the  response  store,  which  is  called  the  S  register.  This,  in  turn,  communicates  with  the 
other  two  1,024-bit  registers,  D  and  E.  The  usual  practice  in  searching  the  whole  memory 
is  to  store  the  results  for  one  half  of  the  memory  in  each  of  these  two  registers,  but  this 
can  be  varied  at  the  programmer's  option.  Contents  of  the  S.  D,  and  E  registers  can  under¬ 
go  a  limited  number  of  logical  operations. 

The  numbers  in  the  S-  D,  and  E  registers  are  not  directly  available  to  the  1604-B. 
A  response  resolver  examines  the  contents  of  the  S  register  and  produces  either  a  count 
of  the  number  of  responders  or  a  10-bit  address  corresponding  to  the  location  of  the  first 
ONE.  These  numbers  are  used  to  control  subsequent  communications  between  the  AM  and 
the  1604-B. 

The  principal  innuts  into  the  AM  from  the  are: 

(1)  Data  words  for  the  comparand  and  mask  registers. 

(2)  Data  words  to  load  the  AM. 

(3)  Addresses  for  AM  loading. 

(4)  Search  instructions. 

The  first  three  of  these  inputs  are  transmitted  as  48-bit  words.  The  search  In¬ 
structions  use  the  1004-B's  nine-bit  external  function  code  instruction.  Composite  instruc¬ 
tions  of  more  than  nine  bits  may  be  transmitted  as  data  words,  and  stored  in  a  separate 
register  within  the  AM.  Outputs  from  the  AM  to  the  1604-B  are  data  words,  including  the 
results  obtained  by  the  response  resolver,  and  various  control  signals. 


2-5 


Comparand 


Mask 


Response 

Resolver 


Figure  2-2.  Goodyear  GAP 


The  central  feature  of  the  GAP  memory  is  the  BILOC  element  used  to  hold 
each  bit.  The  fast  response  of  these  elements  makes  it  po  ible  to  scan  the  entire  half 
memory  in  five  microseconds,  comparing  1,02  4  bits  oi„  uuuieously  with  each  of  the  48 
bits  of  the  comparand  and  forming  the  results  in  the  S  register  according  to  any  of  several 
search  algorithms.  A  test  for  equality  or  inequality  (exact  match  or  mismatch)  is  per¬ 
formed  in  one  pass.  A  more  involved  test,  such  as  "between  two  comparands,"  is  executed 
by  a  sequence  of  tests,  with  logical  manipulation  of  the  results  in  the  S,  D,  and  E  registers. 
A  search  for  the  maximum  of  all  the  numbers  in  half  the  memory  may  be  performed  in  one 
pass  or  several,  depending  on  the  technique  used.  Some  searches  take  a  variable  number 
of  passes,  depending  on  the  values  that  happen  to  appear. 

The  operations  that  take  more  than  one  pass  are  performed  within  the  AM  after 
one  command  from  the  1604-B.  The  most  involved  operation  that  can  be  performed  in  this 
way  is  the  complex  search,  in  which  ti  e  comparand  is  divided  into  as  many  as  eight  sec¬ 
tions,  and  successive  searches,  which  may  be  of  different  kinds,  are  carried  out.  The 
results  may  be  ORed  or  ANDed  together  during  the  operation.  Such  an  operation  may  carry 
out  a  composite  search  in  response  to  one  program  step;  for  example.  "Identify  all  words 
that  contain  a  5  in  index  position  X,  and  have  a  value  between  0.  489  and  0.  511  in  columns 
j  to  k,  except  those  with  a  0,  1,  or  2  in  index  po;  tior.  Y;  also  include  any  word  with  a 
negative  number  in  position  Z,  or  with  a  value  of  at  least  21  in  columns  1  tom;  and  from 
among  these  select  the  word  with  the  lowest  number  in  columns  n  to  p;  in  case  of  a  tie, 
pick  the  word  with  the  highest  value  in  columns  q  to  r.  "  There  are  restrictions  as  to  the 
arrangement  of  the  various  keys,  and  masking  cannot  be  used  in  the  usual  way  in  complex 
searches.  But  any  number  of  complex  sea’  m<ij  ';e  ccralrncd  by  successive  ac¬ 
tions  from  the  1604-B. 

The  GAP  memory  thus  operates  as  an  independent  unit,  carrying  out  its  searches 
and  then  making  results  available  to  the  1604-B.  The  prog  am  may  call  for  the  transfer 
of  the  responding  words  into  core  storage,  or  the  reading  of  their  addresses,  or  the  count  of 
how  many  responders  appeared.  Alternatively,  there  may  be  a  command  to  replace  the 
responders,  or  simply  to  erase  them  without  any  transfer  from  the  AM  to  the  1604-B. 

The  interface  between  the  AM  .and  the  computer  is  assumed  to  use  Channel  7, 
the  1604-B  transfer  channel.  Several  variations  on  this  Idea  have  been  proposed  by  Good¬ 
year.  They  discussed  a  spectrum  of  possibilities,  ranging  from  treatment  of  the  AM  as  a 
strictly  peripheral  device  to  a  tight  integration  providing  the  AM  with  direct  access  to  the 
1604-B  memory.  The  various  possibilities  in  between  the  extremes  involve  modification 
of  one,  two,  or  three  registers  in  the  1604-E.  Without  modification,  a  load,  unload,  or 
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search  operation  may  have  to  be  preceded  by  successive  transfers  to  load  the  necessary 
registers  in  the  AM.  With  modifications  to  one  or  more  of  the  A,  Q,  and  B  registers, 
this  set-up  time  is  reduced.  With  direct  memory  access,  the  execution  times  for  the 
operations  are  also  reduced.  But  in  all  of  these  configurations,  the  AM  operates  asyn¬ 
chronously  with  the  1G04-B,  so  that  each  operation' is  likely  to  iuv.lve  waiting  times  at 
the  beginning  and  end,  and  for  memory  access  in  between. 

2.1.2  Configuration  A2 

Another  combination  of  AM  hardware  and  inter!  ce  to  be  discussed  in  this  report 
is  known  as  A2  (see  Figure  2-3).  It  is  a  version  that  t;_es  less  hardware  than  GAP, 
leaving  more  operations  to  the  1604-B.  On  the  other  hand,  it  employs  more  sophisticated 
operation  codes  than  GAP  iu  uidev  Lo  facilitate  the  execution  of  the  same  operate  "s  with 
less  equipment. 

Configuration  A2  had  a  predecessor  in  the  design  stage,  known  as  Al.  This 
was  stripped  down  almost  to  the  bare  minimum  of  hardware.  It  retained  the  registers 
and  operations  that  were  indispensable  to  AM  operation,  with  additional  facilities  added 
where  they  were  seen  to  provide  a  considerable  increase  in  ability  at  a  moderate  cost.  In 
the  transition  from  Al  to  A2,  this  same  philosophy  of  expansion  was  continued.  Economical 
extensions,  providing  operational  improvements  that  compare  favorably  with  their  cost, 
were  included  in  the  design  of  A2.  Thus,  A2  represents  a  conservatively  simple  associa¬ 
tive  memory. 

In  this  configuration  there  is  only  one  response  register,  but  three  auxiliary 
features  help  in  the  formation  of  logical  combinations  of  response  sets: 

(1)  The  results  of  a  current  test  may  be  combined  with  the 
previous  contents  of  the  S  register. 

(2)  There  are  eight  sets  of  "tag  bits"  which  may  be  used  to 
mask  any  search. 

(3)  The  contents  of  the  S  register  are  available  to  the  1604-B. 

The  combinations  of  previous  contents  of  S  with  current  searches  are  not  limited 
to  AND  and  OR,  but  may  be  an^  logical  function  performed  on  each  bit.  This  includes 
equivalence,  exclusive  or,  if,  etc. 
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Figure  2-3.  Configuration  A2 
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The  tag  bits  may  be  thought  of  as  eight  1,024-bit  registers  whose  contents  may 
be  ANDed  with  the  results  of  any  search  The  loading  of  these  "registers"  takes  place 
bit-by-bit  as  word3  are  loaded  into  the  memory.. 

Unlike  the  GAP  configuration,  A2  offers  the  contents  of  the  S  register  for  reading 
into  the  1604-13.  The  1,024-bit  word  is  divided  into  32  words  of  32  bits  each  for  transler 
into  or  out  of  the  1604-B.  Transfers  into  the  S  register  may  include  the  formation  of  any 
logical  combination  of  the  previous  contents  of  S  and  the  word  supplied  by  the  1604-B.  Thus, 
the  32  words  representing  the  results  of  a  previous  search  may  be  combined  with  the  32 
words  oi  a  new  one  as  they  are  written  into  S.  Any  other  manipulations  of  response  vectors 
may  be  performed  within  the  1604-B,  and  the  results  can  be  placed  in  S  to  control  subse¬ 
quent  transfers. 

Loading  of  the  memory  in  A2,  and  reading  from  it,  are  similar  to  the  corres¬ 
ponding  operations  with  GAP. 

Reading  and  writing  operations  may  select  any  one  of  the  eight  tag  registers  to 
identify  different  sets  of  words  in  the  AM,  to  restrict  transfers  to  members  or  non-members 
r»f  tho0*  and  tn  moHifv  tb**  tag  bits  during  \  Titing. 


In  general,  the  potentiality  of  A2  lies  in  the  flexibility  of  its  operations.  The 
programmer  is  presented  with  a  broad  range  of  possibilities  in  the  performance  of  any 
search.  If  these  are  exploited  fully,  this  AM  will  extract  a  maximum  amount  of  useful 
information  from  each  of  its  operations.  Programming  guidelines  for  this  configuration 
are  given  in  Section  II,  Volume  II. 

2.1.3  ronf^urniica  A.» 

Configurition  A3  achieves  the  ultimate  in  AM- 1604-B  integration  b>  making  the 
associative  memory  a  part  of  the  1604-B  core  memory.  The  iiighest-numbered  2.048  words 
of  the  32,768  in  the  core  memory  are  simply  replaced  by  the  GAP  hardware  or  its  equiva¬ 
lent.  The  BILOC  element  has  read-write  capabilities  compatible  with  those  of  1604-B  core 
memory,  so  that  each  location  in  the  AM  can  be  addressed  normally.  It  is  accessible  to  the 
program,  and  for  communications  in  both  directions  with  whatever  peripheral  devices  are 
employed  in  a  system, 
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As  in  the  other  configurations.  the  S  register  is  1,024  bits,  and  takes  the 
results  of  searches  on  one  half  of  the  memory  at  a  time.  There  are  also  the  D  ar.d  E 
registers,  with  the  capability  of  communication  with  the  S  register  during  each  operation. 

The  effect  of  these  provisions  is  that;  the  results  of  a  current  search  may  be  ANDed  or 
ORed  with,  or  simply  replace,  the  contents  of  S,  D,  or  E,  and  the  result  stored  in  any 
of  the  three  registers,  regardless  of  which  it  depended  on.  The  previous  contents  of  the 
two  registers  not  used  in  the  comparison  "'’ll  be  preserved  in  the  process,  but  will  be 
moved  to  ujfferent  registers  if  accessary.  For  example,  the  registers  may  contain  after 
a  comparison: 

D:  (previous  contents  of  E)  AND  (result  of  current  search) 

S:  (previous  contents  of  D) 

E:  (previous  contents  of  S) 

These  changes  are  effected  by  interchanging  the  contents  of  registers  before 
and  after  the  search. 

The  response  resolver  in  A3  generates  the  count  of  the  number  of  ONEs  in 
D,  E,  or  S,  or  the  addresses  of  responders.  The  contents  of  S,  D,  and  Z  themselves  are 
not  available  to  the  computer  in  forms  other  than  responses. 

An  important  feature  of  A3  is  the  absence  of  certain  registers,  which  is  possible 
because  their  functions  are  executed  by  normal  1604-B  registers.  Some  of  these  have 
been  omitted  irom  the  preceding  diagrams  for  the  sake  of  simplicity.  Both  GAP  and  A2 
have  11-bit  address  registers  to  select  words  in  the  AM  for  writing  or  reading.  They  also 
have  18-bit  ins  -'uctiv.ii  registers,  Wuich  are  used  to  hold  instruction  words  transmitted  by 
the  1604-B  (e.  g. ,  for  the  GAP  complex  search).  Both  of  these  registers  are  obviated  in 
the  A3  configuration.  Addresses  of  words  arc  like  those  in  the  rest  of  core.  They  are 
distinguished  by  the  fact  that  their  first  four  bits  are  1111,  which  are  sensed  by  the  computer 
in  order  to  carry  out  functions  peculiar  to  the  AM,  such  as  the  manipulation  of  busy  bits 
or  reading  responders. 

Instructions  for  the  AM  are  part  of  the  regular  1604-B  program,  and  use  the 
computer's  own  instruction  register.  They  utilize  the  usually  illegal  instruction  numbers 
00  and  77  (octal).  The  remaining  18  bits  of  the  instruction  word  are  used  to  direct  the 
AM  operations.  Details  of  this  scheme  have  been  worked  out  fully  and  are  prer  a  ted  in 
Section  II,  Volume  II. 
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2.2 


COMPARISON  OF  CONFIGURATIONS  STUDIED 


2.2.1  Inti,  eduction 

This  section  amplifies  the  different  features  contained  in  each  of  the  four  asso¬ 
ciative  memories  studied  and  compares  the  four  hybrid  associative  memory  1604-B  con¬ 
figurations.  The  three  approaches  are  represented  by  the  A1  and  A2  configurations 
(which  are  basically  similar),  the  GAP  configuration,  and  the  A3  configuration.  It  is  diffi¬ 
cult,  if  not  impossible,  to  illustrate  the  differences  in  approach,  implementation,  and 
capability  by  a  simple  comparison  chart.  A  full  understanding  of  the  design,  intention, 
and  function  of  these  configurations  is  necessary  to  appreciate  their  different  capabilities. 
For  example,  the  team  anticipated  that  the  A'  configuration  would  best  facilitate  system 
processing  because  of  its  full  integration  with  the  CDC  1604-B.  This  expectation  is  borne 
out  only  if  the  A3  configuration  and  the  1604-B  provide  a  core-to-core  block  transfer  or 
move  instruction.  Because  of  the  small  associative  memory,  such  an  instruction  would  be 
invaluable  in  the  processing  involved  in  sea  surveillance.  Such  a  relatively  small  differ¬ 
ence  in  configurations,  i.e.  ,  the  presence  or  absence  of  a  move  instruction,  cannot  be 
demonstrated  adequately  by  comparison  tables,  but  can  appreciably  enhance  the  system 
processing  for  an  application.  See  Table  2-1  for  the  transfer  rale  of  the  1604- B- A3  men  e 
instruction  that  can  be  achieved  by  modifying  the  1604-B. 

Because  the  A2,  A3,  and  GAP  configurations  each  represent  a  different  organiza¬ 
tional  approach  to  the  employment  of  identical  technology,  and  because  they  have  greatly 
different  inherent  capabilities  and  their  co^*s  are  radically  different,  each  should  be  evalu¬ 
ated  separately  against  different  applications.  Only  in  the  light  of  a  given  application  can 
one  be  called  better  or  worse  than  another.  The  comparison  of  these  configurations  applies 
only  to  the  sea  surveillance  type  of  problem.  Evaluation  against  applications  would  also 
serve  to  discover  minor  shortcomings  (such  as  the  move  instruction  for  the  A3  configura¬ 
tion)  whose  removal  would  enhance  a  given  configuration  in  a  given  application. 

2.  2.  2  Physical  and  Organizational  Differences  in  the  Hybrid  Configurations  A 2, 

A3,  and  GAP 

Some  physical  and  organizational  differences  in  the  four  configurations  are  dis¬ 
played  in  Table  2-1.  This  paragraph  elaborates  upon  the  differences  and,  where  relevant, 
relates  the  effect  of  the  difference  upon  the  sea  surveilla  ce  problem. 
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TA1  v  i  ION  CHARACTERISTICS 


(1)  Number  of  general-purpose  response  vector  registers. 

This  relates  to  the  number  of  1,024-bit  registers  which 
can  receive  the  results  of  a  search  in  which  processing 
of  response  vectors  can  be  performed.  There  are  three 
such  registers  in  A3  while  the  other  configurations  have 
one  each.  The  programmer  hi  s  greater  flexibility  with 
A3,  although  for  sea  surveillance  the  lack  of  such  registers 
would  not  be  critical. 

(2)  Total  number  of  response  sector  registers.  This  relates 
to  the  total  amount  of  storage  in  units  of  1,024  bits  pro¬ 
vided  for  response  vectors.  GAP  and  A3  provide  three 
such  registers;  A2  provides  one  register  plus  l/32nd  of 

a  register  which  is  used  internally  to  accomplish  processing 
of  response  vectors  albeit  slowly  and  further  has  provision 
for  the  storage  of  eight  tag  bits;  A1  has  a  single  response 
vector  register. 

(3)  Flip-flops  per  memory  word  in  the  response  store.  A3  and 
GAP  have  two  and  one-half  flip-flops  per  word;  A2  a  little 
more  than  one-half  flip-flop  per  word;  and  Al  one-half 
flip-flop  per  word.  As  shown  in  Table  2-1,  the  response 
store  complexity  primarily  makes  up  the  difference  in  costs 
between  the  several  memories. 

(4)  Response  vector  shift.  This  indicates  whether  or  not  a  response 
vector  can  be  shifted  to  allow  the  processing  of  fields  and  adja¬ 
cent  words  in  a  logical  manner.  This  is  particularly  important 
for  problems  such  as  sea  surveillance  since  data  words  manu¬ 
ally  extend  over  more  than  one  word.  Aldoes  not  shift  (and 
hence  is  severely  limited):  A2  shifts  up  and  down,  but  does 

this  slowly;  A3  and  GAP  both  shift  down  at  high  speed.  It  was 
found  that  the  ability  to  shift  up  and  down  would  be  useful  to 
the  programmers. 

(5)  Logic  on  response  vectors.  This  indicates  which  of  the  16  possible 
functions  of  two  binary'  variables  can  be  applied  on  a  bit-by-bit 
basis  between  a  present  search  and  previous  results,  or,  in  some 
cases,  between  stored  response  vectors.  Al  provides  no  logic 
per  se  but  does  provide  for  ANDing  an  equal  search  with  previous 
results;  A2  provides  for  all  logic  functions  to  be  processed  in 

32  steps  and,hence,  provides  them  slowly,  A2  provides  a  general 
AND/OR  capability  among  the  three  response  vectors  plus  addi¬ 
tional  setting  and  resetting  capabilities;  GAP  provides  a  re¬ 
stricted  AND/OR  function  between  results  of  a  current  search 
and  previous  results.  The  capability  of  the  A3  is  the  most 
flexible.  The  cost  of  providing  it  with  the  GAP  technology  is  not 
significant  and  should  be  provided. 
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(6)  Use  of  response  store.  This  indicates  the  use  the  programmer 
may  make  of  the  results  of  a  search  or  sequence  of  search 
stores  in  a  response  vector.  In  Al,  the  response  store  only 
conditions  15-bit  addresses  for  retrieval  from  Al;  in  A2,  the 
response  store  conditions  the  retrieval  of  addresses  or  words 
stored  in  the  AM;  in  A3,  the  response  store  serves  to  define 

a  15-bit  base  execution  address  which  may  be  used  in  any  man¬ 
ner  available  to  the  1604-B  (including  the  immediate  retrieval 
of  addresses  or  data,  indexing,  or  indirect  addressing);  in  GAP, 
the  response  store  marks  words  for  retrieval  or  (because  no 
prefix  register  is  employed)  positions  numbers  in  AM  for  re¬ 
trieval. 

This  restriction  in  Al  makes  it  useless  for  the  sea  surveil¬ 
lance  problem.  It  implies  that  an  image  of  the  data  in  AM 
exists  in  core  memory.  Since  changes  in  AM  require  corres¬ 
ponding  changes  in  the  core  image,  a  great  deal  of  wasteful 
processing  results.  The  flexibility  provided  by  A3  is  xceed- 
ingly  useful  for  the  programmer.  The  ability  of  the  programmer 
to  fully  manipulate  the  response  store  is  essential  since  this 
is  the  only  interface  between  the  programmer  and  the  AM  data. 
By  its  very  Jesign,  the  power  of  the  AM  lies  in  its  capability 
to  search  on  contents  and  perform  operations  on  lists. 

(7)  Step-by-step  processing  of  responders.  This  indicates  whether 
or  not  the  1604-B  can  process  responders  individually  and  in 
order  from  the  AM ,  or  whether  it  must  retrieve  a  block  of  re¬ 
sponders  and  perform  its  own  program  loops.  Al,  A2,  and  A3 
allow  step-by-step  processing,  while  GAP  does  not.  The 
flexibility  found  in  Al,  A2,  and  A3  is  more  desirable  than  that  of 
GAP.  The  programming  sequence  for  the  former  is: 

SEARCH 
PROCESS 
GET  NEXT, 

■.  bile  the  programming  sequence  m  the  latter  is: 

SEARCH 

READ  INTO  CORE 
SET  UP  LOOP 
PROCESS 
GET  NEXT. 

As  can  be  noted,  with  GAP,  one  is  working  es  tentially  with  two 
different  memories,  GAP  and  core. 
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(8)  Number  of  hardware  searches.  This  indicates  the  number  of 
different  searches  for  which  the  hardware  and  sequencing  are 
provided  in  AM.  A1  and  A2  implement  four  different  searches; 
A3  implements  the  10  basic  searches  of  one  GAP  plus  four 
slow  variations  of  max,  min,  next  higher  than,  and  next  lower 
than;  the  GAP  implements  the  10  basic  searches  plus  the  be¬ 
tween  limits  search.  The  searches  in  A1  and  A2  are  equality, 
greater  than  or  equal,  and  equality- AND,  greater  than  or 
equal-AND.  The  latter  two  searches  cost  almost  nothing  since 
no  per-word  hardware  is  added.  The  only  additions  are  a  couple 
of  control  flip-flops  and  decoding  gates.  In  the  work  performed 
for  sea  surveillance  no  particular  advantage  was  found.  The 
addition  of  tag  bits,  as  in  the  A2  configuration,  should  be  ade- 
q.iqte  f0i‘  cot'ing  response  store  voffnrs.  The  search  instruc¬ 
tions  represent  the  power  of  the  AM  and  are  useful.  Table  2-1 
shows  the  use  of  the  search  instructions  or.  several  subroutines 
coded  for  GAP.  Of  the  searches ,  programmers  felt  that  it 
would  be  just  as  convenient  to  program  the  NHC  and  NLC  rather 
than  to  include  them  as  hardware  features. 

The  A3  search  is  more  general  than  GAP.  In  the  GAP,  the 
D  and  E  registers  are  associated  with  the  upper  and  lower 
halves  of  memory;  in  A3,  generality  is  provided  through  the 
detail  control  of  a  search  available  in  A3. 

* 

GAP  contains  one  search  feature  not  contained  in  any  of  the 
AUERBACH  memories.  This  is  the  complex  search  which 
permits  the  user  to  segment  a  word  into  fields  and  specify  for 
such  field  the  type  of  search  required.  This  is  a  powerful  in¬ 
struction  when  operating  on  tables  where  different  fields  exist. 
However,  no  advantage  was  taker  of  the  instruction  for  sea 
surveillance  since  data  fields  usually  extended  over  word 
boundaries.  The  complex  search  can  be  provided  as  a  macro 
with  more  capabilities  than  provided  in  GAP.  The  macro 
could  permit  the  programmer  to  jump  out  of  the  search  on  a 
particular  field  to  do  processing  and  then  return  to  the  search. 

(9)  Number  of  search  instructions.  This  indicates  the  number  of 
instructions  which  directly  cause  or  control  searching  of  AM. 
Each  configuration  provides  one  basic  instruction  for  each 
search  plus  modifier  lields.  The  modifier  field  control  is  most 
extensive  and  powerful  in  A3. 

(10)  Number  of  direct  data  manipulating  instructions.  This  repre- 
sents  a  somewhat  subjective  count  oi  the  number  of  instructions 
available  in  tlie  configuration  (excluding  1604-B  instructions) 
which  directly  manipulate  useful  results  us  opposed  to  loading 
or  unloading  ot  block  of  data  or  results.  A1  provides  four  such 
instructions;  A2,  13  such  instructions;  A3  provides  56  such 
instructions  plus  numerous  modifier  fields  (many  of  ^bese 
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instructions  are  the  result  of  incorporating  the  full  set  of 
operand  type  1604-B  instructions  in  associative  modes) ;  GAP 
provides  five  such  instructions  plu3  modifier  fields.  Because 
of  its  more  sound  architecture  (i.  e. ,  the  full  integration  of  the 
A3  with  1604-B  core)  the  programmer  is  provided  a  multiplicity 
of  instructions  and  manipulation  capability  at  no  cost. 

(11)  Number  of  housekeeping  and  load/unload  instructions.  This  indi¬ 
cates  the  number  of  instructions  provided  to  manipulate  the 
special  requirements  of  the  AM  or  to  load  or  unload  blocks  of 
data  to  it.  A1  provides  22  such  instructions  ;A2,  34;  A3, 24  plus 
modifier  field.  No  significance  can  be  gained  from  these  statis¬ 
tics:  as  previously  noted,  the  lack  of  a  core-to-core  move  in¬ 
struction  in  the  A3  configuration  has  tremendous  significance  with 
respect  to  the  effectiveness  of  the  A3  configuration  for  the  sea 
surveillance  problem.  This  observation  is  more  meaningful  than 
the  number  of  instructions  between  GAP  and  the  A3  configuration. 

(12)  Number  of  tag  bits.  The  tag  bits  are  provided  in  addition  to  the 
48  data  bits  of  the  1604-B  word  size,  and  conditions  may  be  speci¬ 
fied  on  them  for  searching.  A1  and  A3  generalize  the  busy  bit  of 
the  GAP  to  produce  a  true  tag  bit.  The  A2  machine  has  eight  tag 
bits  with  some  powerful  processing  provided  for  them. 

As  previously  noted  in  the  findings  and  conclusions,  tag  bits  are 
extremely  important  One  can  always  reduce  the  word  size  of 
the  Al,  A3,  or  GAP  to  permit  the  remaining  bits  to  be  tag  bits. 
However,  when  this  is  done,  an  incompatibility  exists  between 
words  in  AM  and  their  representation  in  peripheral  devices.  When 
placing  data  on  disu  one  wants  to  have  the  full  word  occupied.  If 
it  is  not,  one  has  to  read  a  portion  of  a  word  from  AM  (the  non-tag 
position)  and  this  operation  is  relatively  cumbersome.  Increased 
programming  effort  is  required  la  this  instance  which  is  neither 
desirable  or  warranted.  Providing  tag  bits  avoids  this  necessity. 

One  use  of  the  tag  bits  is  to  segment  AM.  This  is  useful  when 
one  wants  to  name  or  classify  data,  or  when  the  programmer  wants 
to  have  different  classes  of  data  in  AM  at  the  same  time.  The 
general  ability  of  the  tag  bits  may  be  emphasized  by  the  programmer 
when  he  wants  to  have  different  classes  of  data  in  AM  at  the  same 
time. 

(13)  Tag  control  of  loading/search.  This  indicates  whether  or  not  the  con¬ 
figuration  provides  for  a  selective  setting  of  the  tag  bits  when  loading 
and  for  selective  searching  of  them.  The  Al,  A2,  and  A3  configur¬ 
ations  provide  full  tag  control,  while  the  GA*>  configuration  employs 

a  busy  bit  which  must  be  present  for  a  search  to  be  satisfied-  The 
capability  to  provide  tag  control  on  a  loading/search  is  a  logical 
consequence  of  having  tag  bits. 
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(14)  Source/sink  for  load/unload.  This  indicates  the  place 
from  or  to  which  data  may  be  loaded  or  unloaded  to  the 
AM.  All  configurations,  except  A3,  load  and  unload 
only  to  1604-B  core.  The  A3  configuration  because  the 
AM  becomes  an  integral  part  of  the  1604-b  core  memory , 
may  load  or  unload  to  any  peripheral  device  or  transfer 
data  to  other  sections  of  1604-B  core. 

(15)  I/O  load  (1604-B  cyclea)/word  transferred  (in  blocks  or 
singly.  This  item  refers  to  the  load  that  is  imposed  on  the 
1604-B  for  each  word  transferred  to  or  from  associative 
memory  in  block  transfer  operations  or  in  single  word 
transfers. 

(16)  AM  instructions  per  1604-B  word.  This  item  refers  to  the 
number  of  commands  for  AM  which  can  be  contained  in  a 
48-bit  1604-B  word.  The  A1  and  A2  configurations  are  con¬ 
trolled  by  EXF  codes  or  by  INT  or  OUT  instructions.  The 
A3  configuration  uses  a  very  powerful  command  code  which 
can  pack  two  commands  per  word.  The  GAP  configuration 
uses  a  on.  command  per  word  format  Although  Al,  A2, 
and  A3  are  more  efficient  than  GAP  in  this  regard,  it  is 
not  clear  that  this  is  a  significant  factor.  It  is  akin  to  the 
perennial  debate  as  to  whether  one  address,  two  address, 
or  three  address  machines  are  more  efficient  for  the  pro¬ 
grammer. 

(17)  Comparand  and  data  shifting.  This  refers  to  the  capability 

of  a  comparand  and  its  mask  to  be  shifted  before  a  comparison, 
and  whether  or  not  data  shifted  to  or  from  AM  may  be  so 
shifted.  Al  and  A2  provide  no  shifting;  A3  and  GAP  provide 
shifting.  Although  for  the  sea  surveillance  problem  studied, 
no  use  was  made  in  this  report  of  this  feature,  it  is  believed 
essential ,  particularly  for  intelligence  data  processing. 

(18)  Mask  control  on  transfer/ search.  Thii  item  refers  to 
whethor  or  not  transfers  and  searches  may  be  masked 
according  to  the  programmer's  option  or  if  the  mask  is 
always  active.  From  a  programmer's  viewpoint  the  A3 
provides  the  most  flexibility  in  this  regard. 

(19)  Shift  control  on  transfer/search.  This  item  refers  to  whether 
and  bow  the  shifting  of  comparands  and  data  may  be  controlled 
in  transfers  and  in  searches.  The  A3  control  is  useful. 

(20)  Resolver  resettable.  This  item  refers  to  whether  or  not  the 
response^  resolver  is  resettable.  Only  A3  provides  this 
feature  and  it  also  provides  for  automatic  handling  of  resolver 
settings  when  processing  vectors  for  I/O  operations.  This  is 
potentially  useful,  since  the  programmer  often  desires  to  save 
the  output  of  responders  for  processing  at  a  later  time. 
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(21)  Configuration.  This  refers  to  the  manner  in  which  the  AMs  are 
interfaced  with  the  1604-B.  Al,  A2,  and  GAP  are  peripheral 
devices,  with  GAP  featuring  independent  programming  se¬ 
quences  via  a  special  direct  memory  access  channel.  The  A3 
is  a  fully  integrated  configuration. 

The  GAP-DMA  configuration  suffers  from  its  architecturally 
ambiguous  position  as  a  peripheral  device.  That  is,  the  over¬ 
head  in  the  1604-B  necessary  to  set  up,  control,  coordinate, 
and  synchronize  the  GAP  (operating  on  the  DMA  channel)  is 
very  high.  A3  eliminates  most  of  this  overhead  by  allowing 
direct  manipulation  of  the  associative  features  of  the  AM  by 
new  1604-B  instructions,  and  by  allowing  direct  I/O  between 
the  AM  and  tne  peripheral  devices  as  is  necessary  for  the 
AM  to  show  to  advantage  for  the  sea  surveillance  problem.  A3 
has  also  generalized  the  equipment  of  the  GAP  to  allow  more 
flexible  and  powerful  programming.  The  \M  itself  is  based 
upon  GAP  and  is  very  similar  to  it. 

(22)  Co3t  relative  to  GAP.  The  costing  of  the  AMs  is  based  upon 
the  estimates  of  what  it  would  take  to  develop  the  GAP. 
AUERBACH  cannot  estimate  precisely  the  GAP  as  no  de¬ 
tailed  breakdown  of  the  GAP  equipment  was  made  available 
to  them.  What  is, therefore,  more  meaningful  than  absolute 
costs  is  the  ratio  of  the  cost  to  that  of  GAP.  The  Al  costs 
59  percent  of  GAP,  A2  costs  62  percent  of  GAP,  and  A3 
costs  from  95  percent  to  105  percent  that  of  GAP.  Although 
A3  generalizes  GAP  and  is  architecturally  more  attractive, 
its  cost  is  not  significantly  different  from  that  of  GAP.  With 
the  A2,  one  can  afford  twice  the  storage  capacity  of  the  GAP. 
For  sea  surveillance,  GAP  would  not  have  a  significant  ad¬ 
vantage  over  A2.  It  is  not  clear  that  GAP  would  be  better  than 
a  conventional  system.  A  breakdown  of  costs  for  the  four 
configurations  is  shown  in  Table  2-20.  Section  II,  Volume  DL 
Tables  2-14  to  2-20  illustrate  the  items  used  in  costing  a 
system  with  GAP  technology.  Thus,  using  techniques  devel¬ 
oped  in  these  volumes,  one  can  estimate  other  AMs  as  required. 

2.2.3  Comments  on  the  1604-B 


This  paragraph  serves  to  document  some  ri  rations  AUERBACH  Corporation 
has  about  the  validity  of  data  processing  applications  o  the  1604-B  computer.  Our  basic 
reservation  is  very  simple:  the  1604-B,  although  a  fine  computer  of  excellent  design, 
har  not  been  designed  for  data  processing  applications  and  is  not  representative  of  the  best 
available  in  today's  market.  The  Contx*ol  Data  1604  is  basically  a  scientific  computer; 
it  is  in  fact  almost  the  last  of  the  large-scale  machines  designed  specifically  for  business 
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or  scientific  application.  It  is  necessary,  therefore,  to  enter  a  caveat.  This  report 
attempts  to  evaluate  the  very  latest  available  in  technology ,  an  associative  memory ,  in  a 
hybrid  configuration  with  a  somewhat  outdated  and  ill  suited  computer.  The  results  ob¬ 
tained  must  be  reviewed  ii  this  light  for  clearly  the  use  of  a  more  modern  data  processing 
oriented  machine  would  have  a  significant  effect  on  the  goodness  of  a  solution  for  the  sea 
surveillance  problem. 

The  following  paragraphs  discuss  some  points  which  lead  to  AUERBACH'S 
basic  reservations  on  the  1604-B. 

2. 2. 3.1  Input-Output.  The  1604-B  input-output  system  is,  by  today's  standard,  both 
inefficient  and  cumbersome,  specifically: 

(1)  The  1604-B  requires  two  memory  cycles  for  each  word 
transferred  in  or  out  via  the  I/O  system.  True,  this  is 
better  than  the  three  cycles  required  by  the  1604,  but  not 
nearly  so  good  as  the  one  cycle  required  by  the  1604-A 
or  machines  like  the  IBM  7090. 

(2)  The  input-output  organization  of  the  1604-B  is  designed  to 
handle  I/O  blocks  of  perspective  size.  Reading  and 
writing  variable  length  records  is  difficult  and  cumber¬ 
some. 

(3)  No  scatter-gather  facility'  and  therefore  no  data  skipping 
or  suppression  facility  is  provided  in  the  1604-B  I/O 
system.  Scatter-gather  I/O  systems  are  invaluable  when 
processing  large  data  bases  of  variable  length  files. 

2.  2.3.2  Interrupt.  The  1604-B  has  at  best  a  rudimentary  interrupt  system.  By  modern 
standards  it  is  totally  inadequate.  It  provides  no  separate  masking  or  priority  facilities 
for  interrupts  from  peripheral  devices;  such  facilities  are  invaluable  when  trying  to  main¬ 
tain  maximum  access  rates  to  devices  such  as  dis^  files.  The  1604  interrupt  system  re¬ 
quires  a  cumbersome  program  to  determine  what  has  caused  an  interrupt  (and  in  some 
cases  it  is  impossible  to  determine  what  has  caused  an  interrupt).  In  other  cases,  at 
least  in  the  1604,  it  is  possible  to  miss  interrupting  conditions  (i.e. ,  not  have  the  inter¬ 
rupt  hardware  recognize  that  the  condition  lias  occurred).  Although  this  has  been  corrected 
in  the  1604-A,  its  status  in  the  1604-B  is  unknown. 
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2. 2. 3. 3  Instructions.  The  instruction  set  of  the  1604-B  is  not  specifically  suited  to 
data  processing  and  data  manipulation  operations.  Specifically,  there  are  bit  handling 
instructions,  no  character  handling  instructions,  no  block  transfer  instructions,  no 
variable  length  character  string  instructions,  no  translate  instruction,  and  no  facility 
for  decimal-to-binary  conversion  and  vice  versa. 

2. 2. 3. 4  Peripheral  Equipment.  The  disc  file  employed  in  the  configurations  studied  here 
was  chosen  by  RADC  to  be  the  CDC  818  disc  file  system.  This  system  is  an  older  design 
and  cannot  compete  favorably  with  such  systems  as  the  IBM  1302  which  is  available  today. 
Since  a  large  data  manipulation  file  processing  problem  is  highly  dependent  upon  the  space 
available  and  the  speedy  access  to  data  contained  in  mass  storage,  the  use  of  the  CDC 
disc  file  surely  biases  the  results  of  this  study. 
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SECTION  III.  CRITIQUE  OF  THE  GOODYEAR  AEROSPACE  CORPORATION 

ASSOCIATIVE  PROCESSOR 


3.1  INTRODUCTION 

The  purpose  of  this  section  is  to  critique  he  Goodyear  Associative  Processor 
(GAP) from  two  viewpoints:  hardware  and  software.  This  critique  is  intended  to  be  brief 
and  is  limited  to  major  topics;  minor  comments  are  given  in  Volume  n.  It  is  difficult 
to  divorce  a  critique  of  the  GAP  from  a  critique  of  the  1604-B.  Since  the  use  of  the  1604-B 
computer  as  the  connecting  device  to  the  GAP  cannot  be  considered  as  an  ideal  combination, 
some  of  the  difficulties  noted  exist  because  of  the  1604-B.  This  fact  should  be  borne  in 
mind. 

3.2  HARDWARE  STUDIES 

This  paragraph  discusses  the  design  (configuration  and  logic)  of  the  Goodyear 
Associative  Processor.  The  discussion  is  based  on  the  GAP  as  defined  in  the  Program¬ 
ming  MAiual  and  Goodyear  Proposal  to  RADC  (Appendix  A,  Volume  II) ,  and  is  based  upon 
the  experience  of  AUERBACH  Corporation  in  applying  and  using  the  associative  technology 
developed  by  Goodyear  on  a  large  file  oriented  processing  problem.  The  comments  pre¬ 
sented  here  are  purely  qualitative  and  represent  both  different  design  philosophies  and/or 
biases  and  the  results  of  several  months  of  analysis  and  study  of  the  proper  application 
of  associative  technology. 

3. 2. 1  Direct  Memory  Access 

The  GAP  and  the  1604-B  computer  are  formed  into  an  hybrid  associative  proc¬ 
essing  system  by  using  the  direct  memory  access  (DMA)  channel  specified  by  Goodyear 
Aerospace  Corporation.  This  DMA  channel  provides  for  the  1604-B  computer  an  input-output 
channel  that  is  very  similar  to  th*>  channels  which  come  as  standard  equipment  on  machines 
of  the  IBM  7090  class.  Specifically,  the  DMA  channel  allows  the  input  and  output  of  in¬ 
formation  directly  to  1604-B  memory  under  control  of  an  external  mechanism.  In  the  7090 
type  machines,  the  external  mechanism  is  provided  by  normal  computer  programming 
(i.e. ,  using  a  starting  address  in  block-size  information)  and  then  the  mechanism  attends 
to  the  details  of  transferring  the  data  to  or  from  memory  as  defined  by  the  set  up  in  the 
control  registers.  By  providing  other  means  for  the  initialization  and  manipulation  of  the 


control  registers  in  such  external  mechanisms,  great  flexibility  can  be  achieved.  The 
Goodyear  design  for  a  DMA  channel  requires  that  the  associative  processor  specify  to 
the  1604-B  input-output  control  logic  the  address  of  each  word  to  be  transferred.  The 
address  to  be  operated  upon  is  determined  internally  in  the  associative  processor.  The 
design  allows  for  the  accessing  of  instructions  and  data  via  this  mechanism.  All  this,  iv 
effect,  provides  the  associative  processc.'  with  two  memories,  the  associative  memory 
and  the  1604-B  memoiy.  The  DMA  design  then  requires  that  instructions  for  the  asso¬ 
ciative  processor  be  resident  in  1604-B  memory  where  they  are  accessed  in  order  by  the 
associative  processor.  The  net  result  is  that  as  long  as  separate  strings  of  program 
instructions  exist,  for  the  associative  processor  and  the  1604-B,  they  may  run  indepen¬ 
dently  of  each  other.  The  1604-B  uses  its  own  memory  for  instruction  and  data  storage, 
while  the  associative  processor  uses  the  1604-B  for  the  instruction  and  data  storage  and 
its  own  memory  for  data  storage  and  search.  It  is  apparent  from  the  Goodyear  Associate  e 
Processor  design  and  the  direct  memory  access  channel,  that  the  independence  of  the 
operation  achieved  by  this  configuration  wac .  in  fact,  the  goal  desired.  In  the  opinion  of 
AUERBACH  Corporation,  however,  all  this  was  to  no  avail. 

To  understand  why  the  above  statement  is  true  we  must  review  the  reasons  for 
providing  independent  processing  of  subordinate  tasks  within  a  computer  system.  We 
refer  to  subordinate  tasks  because  it  is  clearly  intended  that  the  associative  processor  be 
subordinate  to  the  1604-B  (it  has  no  real  manipulative  abilities  of  its  own)  and  it  is  equally 
apparent  that  the  design  of  the  1604-B  computer  will  not  allow  the  efficient  use  of  more 
than  one  task  at  a  time.  Therefore,  the  activity  of  the  associative  processor  is,  in  all 
cases,  in  support  of  the  activity'  of  the  1604-B.  Subordinated  and  independently  running 
mechanisms  are  frequently  incorporated  into  computing  systems  to  accon  modate  simple 
and  time  consuming  tasks  without  the  detailed  attention  of  the  main  computing  elements. 

In  addition,  subordinate  elements  are  frequently  used  to  accomplish  major  processing 
tasks  in  parallel  when  such  parallelism  can  be  balanced  and  controlled,  and  of  course, 
when  it  is  called  for  by  the  nature  of  the  problem.  In  both  cases,  efficiency  dictates  that 
the  main  computing  element  has  other  functions  to  perform  while  the  subordinate  elements 
ars  performing  their  independent  functions.  For  normal  input-output  buffer  operations 
this  "lap  is  accomplished  by  the  job  stacking  or  ring  processing  techniques.  When  new 
inpui  nng  read  into  a  computer,  results  of  previous  processing  are  read  out  and 
*  urre*. .  processing  is  undertaken. 
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In  the  1604-B-GAP  configuration,  the  conditions  f  r  useful  independence  do  not 

«k 

hold.  The  ta^k  for  which  the  GAP  is  intended  usually  takes  only  a  few  microseconds  to 
execute.  Considerable  time  is  required  to  set  up  the  GAP  for  the  execution  of  these  oper¬ 
ations;  however,  this  is  generally  data  loading  and  unloading  time  and  it  cannot  be  usefully 
overlapped  with  other  operations  in  the  1604-B.  When  the  1604-B  program  decides  a  GAP 
operation  is  needed, it  is  generally  at  a  point  at  which  there  is  no  useful  parallel  work  to 
accomplish  while  the  data  is  loaded  into  GAP,  operated  upon,  and  the  results  loaded  back 
into  the  1604-B.  Therefore,  the  1604-B,  after  initiating  the  full  sequence  of  operations, 
waits  for  the  GAP  to  provide  results.  In  effect,  during  the  load,  search,  and  unload  times, 
the  1604-B  simply  coordinates  and  equates.  Under  such  conditions  the  direct  memory 
access  channel  requires  more  elapsed  time  than  would,  for  instance,  the  high  speed  input- 
output  channel  (channel  7)  of  the  1604-B,  and,  in  addition,  it  costs  more. 

If  there  were  adequate  coordination  facilities  (such  as  interrupts)  between  the 
1604-B  and  the  GAP,  it  is  still  uncertain  that  useful  overlaps  could  be  obtained  in  their 
operation.  Without  major  overlaps  between  the  1604-B  and  the  GAP  operation,  no  useful 
purpose  is  3erved  by  the  DMA  channel. 

On  the  other  hand,  if  the  1604-B  were  replaced  by  a  much  faster  computer,  then 
the  GAP  would  take  the  form  of  the  slow  peripheral  device.  In  particular,  in  a  large 
multi-processing  system  ( i.  e. ,  in  terms  of  speed  of  loading  and  unloading) 
the  GAP  memory  would  be  such  that  the  direct  memory  access  would  be  of  considerable 
value.  Another  way  of  looking  at  it  is  to  consider  a  multi-processing  system  where  the 
computer  would  be  working  on  problems  other  than  those  associated  with  the  GAP.  In 
this  case,  the  direct  memory  access  would  be  "nluable.  The  main  function  of  the  direct 
memory  access  is  to  allow  for  loading  and  unloading  of  the  GAP  at  the  speed  of  the  read- 
writc  operation  of  the  core  memories  in  the  GAP.  A  reasonable  recommendation  for 
improvements  in  the  GA10  would  be  to  consider  utilizing  a  faster  switch  core, thereby 
reducing  the  time  it  takes  to  load  and  unload  the  GAP.  A  very  important  consideration 
in  terms  of  racimin sedations  would  be  a  modification  of  the  organizations  such  that  the 
GAP  could  be  loaded  and  unloaded  v<?  peripheral  equipments. 

3.2,2  General  Comments 

In  light  of  the  preceding  paragraph  It  is  still  legitimate  to  ask  what  comments 
can  be  made  of  the  efficacy  of  the  configuration  using  a  DMA  channel.  It  seems  to  us  that 
the  GAP  configurcdon  suffers  from  being  assembled  with  ad  hoc  features  being  appended 


to  the  design.  There  seems  to  be  no  edifying  thought  given  to  coordinated  and  integrated 
design.  This  is  perhaps  understandable  if  one  considers  that  RADC  is  purchasing  from 
Goodyear  a  technology  rather  than  a  design  sophistication.  For  the  design  philosophy 
employed  in  the  GAP,  however,  certair  weaknesses  are  readily  apparent;  these  are: 

(1)  The  interrupt  generating  mechanism  in  the  GAP  is  not 
suffic  ‘ent  to  provide  the  proper  coordination  between 
the  1604-B  and  the  GAP.  We  would  rate  this  a  design 
fault  since  there  appears  to  be  no  logical  reason  for  this 
inaccuracy.  There  is  room  in  the  coding  of  instructions 
to  allow  for  interrupt  control. 

(2)  The  instruction  repertoire  of  the  GAP  is  not  nearly  as 
efficient  as  it  might  be  and  considerable  efficiency  could 
be  gained  in  the  overall  system  by  a  careful  analysis  of  the 
uses  and  modes  of  operation  of  the  GAP  and  the  provision 
of  instructions  to  accommodate  these  operations. 

(3)  The  busy  bit  of  the  GAP  represents  a  form  of  associative 
capability  which  has  been  provided  in  highly  restricted  form  for 
no  apparent  reason.  There  is  reason  enough  for  the  capability 
but  none  for  the  restriction.  The  busy  bit  might  as  well  be  a 
generalized  tag  bit  as  in  the  AUERBACH  A3  configuration. 

(4)  The  response  vectors  in  the  GAP  (S,  D,  and  E  registers)  are 
also  overly  restricted  for  no  apparent  reaso  .  Again  the  A3 
configuration  illustrates  how  these  facilities  might  be  general¬ 
ized. 

(5)  The  design  choices  leading  to  tne  provision  of  pre-response 
vectors,  each  employing  2,000  flip-flops  and  a  single  restricted 
tag  bit,  are  open  to  serious  question.  Other  techniques  are 
available  for  the  storage  of  response  vectors,  and  additional 
tag  bits  are  probably  of  great  value.  The  AUERBACH  A2  and 
A3  configurations  illustrate  some  of  the  choices  available. 

(6)  The  requirement  for  loading  and  unloading  the  GAP  from  an 
external  memory,  rather  than  from  either  another  peripheral 
device  and/or  from  an  external  memory,  imposes  a  severe 
programming  restriction  on  time  and  data  handling. 

These  comments  may  be  expanded  upon  in  order  to  be  constructive  criticisms 
for  hardware.  This  is  accomplished  regarding  the  instruction  repertoire  discussed  in 
Paragraph  3. 3.  Regarding  the  third  comment  on  busy  bits,  it  should  be  pointed  out  that 
the  use  of  the  busy  bit  is  to  perform  isolated  searches  on  the  memory  while  retaining  the 
original  information.  Since  the  busy  bit  is  stored  in  the  core  memory,  it  is  necessary  to 
read  and  write  in  a  word  position  in  order  to  change  the  busy  bit.  The  alternative  might 
be  to  provide  over  2,000  flip-flops  to  perform  the  function  of  the  busy  bit  As  an  alternate 


it  seems  that  the  use  of  the  core  memory  for  the  storage  of  these  bits  is  less  costly 
than  the  flip-flops  and  may  be  utilized  by  the  programmer  if  he  wishes.  A  recommenda¬ 
tion  in  this  area  would  be  for  additions  in  the  instruction  repertoire  which  wouk'  ignore 
the  busy  bit.  The  comment  on  response  vectors  being  overly  restricted  is  somewhat 
subjective  if  the  programmer  properly  utilizes  the  P,  V,  and  T  indicator  tags. 

3.3  SOFTWARE 

3.3.1  Introduction 

The  software  critique  is  based  upon  observations  made  in  programming  the 
solution  of  the  defined  sea  surveillance  problem.  It  should  be  clear  that  the  following 
comments  regarding  the  programming  of  the  GAP  are  neither  complete  nor  universal. 

In  other  words,  a  programming  defiency  noted  in  this  study  may  not  exkt  in  another  ap¬ 
plication  and  conversely. 

From  a  programming  viewpoint,  the  items  to  be  considered  concern: 

(1)  Learning  experience. 

(2,  Input-output  of  data  and  the  LDR  instruction. 

(3)  Multi-programming  of  GAP  and  1604-B. 

(4)  Data  dependent  processing. 

(5)  Fixed  field  format  processing. 

(6)  .  Instruction  repertoire. 

The  latter  part  of  this  paragraph  recomrpends  a  software  technique  (i.  e. ,  the 
Tag  memory)  devised  to  overcome  some  deficiencies  noted. 

3.3.2  Learning  Experience 

The  AUERBACH  personnel  who  were  directly  involved  in  programming  this  sea 
surveillance  system  for  the  GAP  have  an  average  of  more  than  eight  years'  experience 
(from  five  to  fourteen  years)  and  they  may  be  considered  as  senior  programmers.  Without 
exception,  it  has  been  our  experience  that  it  is  very  difficult  to  become  familiar  with  and  to 
understand  the  GAP.  At  fifis  point  in  time  (nine  months  after  this  study's  beginning),  these 
people  are  just  beginning  to  feel  that  they  can  program  GAP  effectively.  Thus,  in  comparison 


with  other  computers,  it  is  felt  that  the  learning  curve  of  the  GAP  has  the  minimum 
slope.  In  other  words,  our  experience  has  been  that  it  is  more  difficult  to  learn  how  to 
program  GAP  than  any  other  computer  that  the  group  has  encountered.  Interestingly 
enough,  this  includes  the  SOLOMON  series  of  processors.  To  clarify  this  comment  fur¬ 
ther,  it  is  necessary  to  define  wuat  is  required  to  "become  familiar  and  understand  GAP.  " 
Additionally,  it  is  necessary  to  indicate  why  the  G/P  is  difficult  to  understand. 

It  is  bellied  that  a  programmer  understands  and  is  familiar  with  a  processor 
when  he  no  longer  finds  it  necessary  to  consult  the  programming  manual  to  accomplish 
normal  processing.  Of  course  it  may  be  necessary  to  reference  the  manual  when  it  is 
desired  to  optimize  some  section  of  programming  for  time  and/or  space  considerations. 
Thus,  it  is  felt  that  a  programmer  who  understands  GAP  knows  the  complete  instruction 
repertoire,  the  effect  of  instructions  upon  registers,  and  the  interface  dependent  con¬ 
straints.  Usually,  the  programmer  learns  about  these  topics  from  the  programming  manual. 
This  naturally  leads  into  one  of  the  reasons  that  the  GAP  is  difficult  to  understand  -  the 
programming  manual  as  presently  written  (Appendix  A,  Volume  II)  is  lacking, and  is  not 
clear  in, many  details. 

The  programming  manual  requires  the  following  additions: 

(1)  A  general  explanation  of  the  1604-B-GAP  hybrid  configuration 

with  a  diagram  showing  the: 

•  Addressable  Registers 

•  Data  Flow 

•  Control  Lines 

•  Interface  showing  GAP  "autonomy,"  and  the  other  com¬ 
ponents  given  in  the  explanation. 

•  An  explanation  of  all  registers  -  both  addressable  and 
not  addressable  (provided  that  the  latter  may  affect  an 
instruction  or  its  results). 

•  Storage  of  instructions  and  data. 

*  Between  1604-B  core  and  GAP  memories. 

•  Between  1604-B  core  and  GAP  registers. 

•  Input-output  characteristics  of  the  system  detailing 
restrictions  such  as  saturation's  impact  on  GAP 
processing. 


(2)  Instructions 


*  Contents  of  all  affected  registers  before,  during, 
and  after  an  instruction  is  executed. 

*  Operation  sequence  related  to  diagram  given  in 
introduction  description. 

•  Timing  on  a  detailed  level. 

•  Sequences  required  to  start  and  continue  GAP 
operations . 

Perhaps  the  most  important  item  required  in  the  current  programming  manual 
is  a  general  explanation  of  the  system.  At  the  present  time,  some  concepts  of  the  system 
may  be  inferred  from  the  manual  after  numerous  readings,  but  they  may  not  be  valid. 

For  example,  "buffer”  as  used  in  the  manual  has  no  relationship  to  a  programmer'?  usual 
(and  almost  universal)  concept  of  "buffer. "  This  example  provides  another  reason  why 
the  GAP  is  difficult  to  understand. 

The  GAP  instruction  repertoire  —  data  processing  tools  —  is  in  the  main  a 
completely  new  set  of  operations;  that  is,  there  exists  little  similarity  between  the  GAP 
and  other  processors'  instruction  sets.  Thus,  a  programmer  experienced  in  using  instruc¬ 
tions  of  conventional  processors,  where  all  instruction  repertoires  are  somewhat  similar, 
is  in  the  GAP  sense,  an  inexperienced  programmer.  Indeed,  the  conventional  experienced 
programmer  is  somewhat  handicapped  in  at  least  the  following  two  ways; 

(1)  There  exists  some  GAP  instructions  that  are  similar  to 
"conventional”  instructions;  thus, there  exists  some  rationale 
for  using  conventional  processing  techniques. 

(2)  Any  rationale  for  using  conventional  processing  techniques 
for  GAP  processing  biases  a  solution  incorrectly. 

The  situation  is  somewhat  analagous  to  replacing  a  tradesman’s  hand  tools  with 
power  tools. 

The  difficulties  noted  in  learning  the  GAP,  from  a  programmer’s  viewpoint,  can 
be  greatly  obviated  with  a  resulting  decrease  in  required  learning  time  by  providing  a 
comprehensive  pedagogically  sound  programming  manual. 


3-7 


There  are  two  types  of  data  inputs:  associative  memory  storage  inputs  and 
inputs  to  addressable  registers.  In  addition,  certain  instructions  (erase,  decrement 
and  increment  index,  exact  match)  permit  an  actual  modification  (or  simulation)  of  data 
which  may  in  some  sense  be  considered  as  data  input.  Data  outputs  include  transferring 
associative  memory  words  contents,  addresses,  or  numerical  count  into  the  1604-B  core 
memory.  The  purpose  of  this  paragraph  is  to  point  out  some  difficulty  experienced  in 
using  them. 

Consider  the  LDR  instruction.  This  instruction  specifies: 


Bits 

Contents 

36-41 

S,  Shift  Count 

24-35 

R,  an  Associative  Memory  Address 

11-  0 

N,  a  Word  Count 

where  R  is  the  beginning  address.  The  LDR  instruction  is  a  supplementary  instruction  for 
18  other  instructions,  including  search  instructions.  *  The  function  **  of  the  LDR  is  to: 

(1)  Specify  the  number  of  places  the  comparand  is  to  be  shifted. 

In  input-output  operations,  the  comparand  is  used  as  a  one 
word  buffer  between  the  GAP  and  the  1604-B  core  memory. 

In  search  operations,  it  specifies  the  number  of  places  that 
the  comparand  is  shifted  prior  to  the  search  operation. 

(2)  The  R  and  N  fields  segment  the  associative  memory  into 

’’■‘■N-l)  contiguous  words  beginning  at  location  R.  The 
ruction  is  performed  using  the  data  in  this  segment. 

The  main  purpose  of  this  paragraph  is  to  consider  the  difficulty  in  forming  this 
instruction.  If  S,  R,  and  N  are  known  before  run  time  (i.  e.  ,  their  values  are  fixed  and 
constant) ,  then  no  difficulty  occurs.  But  suppose  that  processing  is  data  dependent  (i.  e. , 
the  number  of  words  to  be  unloaded,  for  example,  are  not  known,  nor  is  the  starting 
address),  then  the  programmer  formulates  LDR  by  the  following  algorithm  (or  slight 
variations). 


(1)  Read  address  of  desired  "first"  responder,  which 

is  R.  The  LDR  instruction  is  not  applicable;  therefore, 

*  This  paragraph  also  applies  to  the  LDR  instruction  used  for  search  instructions. 
**One  or  more  of  S,  R,  and  N  may  not  be  applicable. 
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this  address  is  written  into  1604  memory  location  H. 
in  bits  36  through  24;  all  other  bits  are  zero.  1 

(2)  Read  count  of  responders  which  is  written  into  location 
H2  in  bits  11  through  0,  all  other  bits  zero  filled. 

(3)  Halt  die  GAP  processing. 

(4)  Using  1004-B  instructions,  formulate  LDR  by  merging  and  ILj. 

(5)  Pea  tart  the  GAP  processing. 

This  sequence  of  operations  occurred  more  than  80  percent  of  the  time  since  the 
R  and  N  fields  were  often  data-dependent. 

A  solution  to  this  problem  is  perhaps  not  to  zero  fill  when  reading  out  addresses 
and  counts  of  responders.  Then  and  H2  may  be  equal  and  the  address  of  the  next 
LOR  instruction. 

It  was  often  the  case  that  the  LDR  field  values  remained  constant  (after  being 
determined)  for  long  sequences  of  instructions.  It  became  tiresome  repeating  this  instruc¬ 
tion  to  the  extent  that  a  programmer  started  to  apply  the  instruction  or  specify  some  field 
incorrectly, for  operations  not  requiring  the  instruction  or  field. 

The  use  of  the  LDR  instruction  as  applied  to  search  instructions  is  not  clear. 

It  may  be  used,  by  implication,  to  divide  the  memory  into  segments  of  different  data 
classes.  But  if  this  is  true,  what  happens  to  respective  portions  of  the  D,  E,  and  RS 
buffers  not  in  the  segmented  portion?  What  happens  if  a  buffer  advance  is  given:  does  R 
advance  by  1  also?  What  happens  if  R  and  >-T  apply  to  the  lower  memory  and  the  P  bit 
specifies  the  upper  memory?  etc.  On  the  other  hand,  these  questions  seem  to  be  so  un¬ 
reasonable  (but  valid,  if  all  fields  of  LOR  are  specified  for  a  search  instruction)  to  assume 
that  only  the  S  field  of  the  LDR  instruction  is  applicable  in  search  operations.  But  it  was 
often  required  to  partition  the  memory  into  segments  with  a  particular  set  of  instructions 
applicable  to  particular  memory  segments.  For  example,  the  associative  memory  may 
contain  data  that  is  not  relative  to  the  current  subroutine  being  performed.  Therefore, 
it  is  njrt  desired  to  perform  operations  of  the  current  subroutine  on  irrelevant  data  (i.  e. , 
data  to  be  used  by  subsequent  subroutines).  This  associative  memory  segmentation  required 
for  any  GAP  instruction  may  be  accomplished  by  hardware  or  by  software  techniques. 

One  way  to  accomplish  segmentation  by  hardware  is  to  extend  the  LDR  instruction  to  make 
it  applicable  to  all  GAP  instructions,  if  desired.  This  stud  accomplished  the  desired 
segmentation  by  software,  via  the  subroutine  SPO  (Volume  II,  Appendix  C). 
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3.3.4  Multi-Programming  of  GAP-1604-B 

From  the  GAP  viewpoint,  it  is  an  autonomous  unit  performing  instructions  after 
it  starts  operating  until  a  Hal*'  instruction  occurs.  The  GAP  is  started  by  a  1604-B  pro¬ 
gram  containing  either  a  resume  or  a  force  instruction.  The  1604-B  may  also  halt  the 
GAP  by  a  clear  operation.  Thus,  the  GAP  may  be  "turned  on  or  off"  by  a  1604-B  routine; 
the  GAP  may  turn  itself  off,  but  not  on.  While  GAP  is  operating,  the  DMA  permits  the 
1604-B  to  simultaneously  execute  a  sequence  of  instructions. 

Considering  this  configuration  from  a  systems  vie',rpoint,  it  has  often  been  found 
that  a  sequence  of  1604-B  instructions  (LDR  instruction  formulation  or  arithmetic  opera¬ 
tions)  were  required  in  the  mainstream  of  GAP  processing.  It  is  then  required  to  inter¬ 
leave  these  1604-B  instructions  into  the  set  (if  any)  of  currently  operating  1604-B  instruc¬ 
tions.  For  example,  suppose  that  this  point  has  been  reached.  The  GAP  halts  and  the 
main  1804-B  instruction  set  continue.  It  can  sense  whether  or  not  GAP  is  in  operation; 
however,  several  questions  occur: 

(1)  When  does  the  mainstream  1604-B  instruction  set  sense 
GAP  activity?  Every  k  microseconds?  Every  k  instructions? 

After  the  k“  subroutine? 

(2)  If  the  GAP  is  inactive.does  it  mean  that  it  is  now  free  ("Free" 
is  not  synonymous  with  "inactive")?  Or  does  it  mean  that 

it  requires  some  1604-B  instruction  sequence  ? 

(3)  If  it  (GAP)  does  need  some  1604-B  instruction  set,  to  what 
1604-B  location  is  control  transferred  by  the  mainstream 
1604-B  program? 

(4)  After  the  1604-B  set  of  instructions  required  by  the  GAP 
are  executed,  to  what  1604-B  address  is  control  transferred 
to  continue  GAP  processing.  Similarly,  how  is  the  main¬ 
stream  1604-B  program  resumed? 

Before  attempting  to  answer  this  question,  consider  another  example.  Suppose 
that  data  being  processed  by  GAP  had  a  higher  priority  than  1604-B  processing  and  a  penalty 
(for  example,  lost  radar  data)  was  paid  for  delay  of  GAP  processing.  If  the  data  being  ■ 
processed  by  GAP  is  required  from  a  peripheral  device,  it  must  be  read  by  a  1604-B  in¬ 
struction  into  the  1604-B  before  it  can  be  transferred  into  GAP  by  a  GAP  instruction.  In 
this  example,  a  programmer  should  execute  his  1604-B  input  instruction  k  microseconds 
before  he  needs  the  data,  where  k  is  equal  to  the  sum  of  the  time  to  read  the  peripheial 


S 
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device  data  into  1604-B  core  memory  (in  this  time  period  the  GAP  may  resume)  and  the 
time  to  transfer  the  data  from  core  memory  to  GAP  (which  the  GAP  executes  with  a  proper 
LDR). 

x 

Questions  (3)  and  (4)  may  be  answered  by  use  of  the  Store  Program  Counter  and 
Jump  instruction  to  a  Halt  instruction.  If  the  GAP  needs  some  intermediate  1604-B  in¬ 
structions,  the  stored  program  counter  indicates  where  these  instructions  are  performed 
and  a  resume  instruction  picks  up  the  program  counter  set  by  the  Halt  instruction.  If 
GAP  Is  finished  processing,  it  does  not  perform  the  Store  Program  Counter  and  Jump 
Instruction;  therefore,  the  counter,  which  is  not  set,  may  be  used  as  a  flag.  This  device 
then  answers  question  (2)  on  whether  the  GAP  is  free  or  waiting  for  some  intermediate  set 
of  1604-B  instructions. 

The  preceding  technique,  however,  does  not  answer  question  (1).  The  current 
design  of  the  system  dictates  that  either' 

(1)  All  1604-B  mainstream  routines  be  shorter  than  GAP 
processing  if  the  GAP  cannot  be  delayed,  or 

(2)  The  1604-B  mainstream  routines  periodically  sense  the 
GAP  to  see  if  it  is  active.  If  not  active,  the  test  of  the 
stored  program  counter  determines  the  status  of  the  GAP 
(i.  e. ,  waiting  for  intermediate  processing  or  available 
for  further  use). 

The  first  alternative  may  be  difficult  to  control.  If  it  (time  for  1604-B  processing)  cannot 
be  controlled,  data  may  be  lost  in  the  latter  example.  A  suggested  solution  to  answer 
question  (1)  is  to  provide  an  Interrupt  capability  for  GAP  inactive. 

3.3.5  Data  Dependent  Processing 

On  several  occasions,  programming  difficulty  in  data  dependent  GAP  processing 
was  commented  upon  in  the  preceding  paragraphs;  for  example,  the  awkwardness  of  setting 
the  LDR  fields  given  in  the  algorithm  of  Paragraph  3. 3. 3.  The  purpose  of  this  paragraph  is 
to  extend  these  comments  to  other  Instructions. 

Data  dependent  processing  is  where  the  transfer  of  control  to  one  segment  of 
processing  instead  of  another  segment  is  determined  by  the  status  of  the  data.  Thus,  data 
dependent  processing  requires  a  test  for  a  particular  data  condition  and  a  corresponding 
control  (or  tranter)  operation.  The  test  may  immediately  precede  the  control  operation. 


In  this  case,  the  two  operations  (test  and  control)  are  termed  a  branch  operation.  In 
addition,  the  control  (transfer)  operation  which  is  set  by  the  data  status  test  may  follow 
some  intermediate  (between  the  test  and  control  operation)  processing  which  usually  alters 
the  original  data  (see  Paragraph  7.8).  In  this  case,  the  control  operation  is  usually  termed 
a  variable  connector  set  by  the  test  operation. 

Perhaps  the  main  feature  of  GAP  is  its  ability  to  test  the  status  of  many  data 
words  at  one  time.  Such  a  status  test  may  be  either  a  single  test  or  may  be  a  series  (or 
logical  combination)  of  related  but  individual  tests.  The  results  of  such  a  test  (and  tests) 
are  stored  in  the  D  and  E  buffers.  Based  on  these  buffers,  certain  operations  may  be  per¬ 
formed.  One  philosophy  of  the  GAP  is  that  whenever  a  new  test  (note  definition  of  "test" 
just  given)  is  required,  the  previous  test  results  are  no  longer  needed  and  are  consequently 
discarded. 

Perhaps  the  weakest  point  of  GAP  is  its  inability  to  manipulate  the  response 
stores  of  different  tests  —  they  cannot  be  stored  directly.  This  is  true  in  regard  to  data 
dependent  operations.  The  philosophy  of  destroying  previous  status  tests  results  in  the 
situation  that  the  GAP  cannot  process  "nested"  status  tests  without  destroying  results  of 
preceding  status  tests.  This  nested  status  test  capability  does  exist  on  less  than  a  word 
basis  by  use  of  a  complex  search  instruction.  Even  in  this  case,  the  intermediate  results 
of  a  complex  search  are  lost  but  may  be  easily  formed  with  a  new  test.  In  both  the  word 
search  schemes  and  the  complex  word  search  instruction,  a  conditional  test  must  always 
start  at  the  beginning  of  the  sequence  to  tie  tested. 

In  using  GAP,  analysis  for  status  tests  or  conditional  statements  must  be  per¬ 
formed  to  reduce  nested  "If"  statements  to  serial  "If"  statements  whenever  possible.  If 
this  cannot  be  accomplished,  since  intermediate  results  cannot  be  directly  stored,  the 
entire  sequence  of  neskd  testa  must  be  performed  repeatedly.  (Every  iteration  erases 
"first"  responders.)  This  also  yields  another  way  out  of  this  problem.  The  phrase 
"since  intermediate  results  cannot  be  directly  stored"  suggests  searching  for  a  method 
to  store  intermediate  results.  This  is  given  subsequently  for  GAP.  (It  is  built-in  hard¬ 
ware  in  the  A2. ) 


To  this  point,  consideration  has  been  given  to  status  tests  of  data.  Consider 
at  this  time  the  control  functions.  The  only  way  in  GAP  to  alter  a  processing  path  based 
on  a  conditional  relationship  is: 

(1)  Jump  on  no  responders. 

(2)  Jump  on  index  high. 

(3)  Jump  on  index  low. 

The  preceding  paragraph  parenthetically  mentioned  erasing  the  first  responder 
in  each  iteration  of  the  nested  conditionals.  Then  use  of  the  first  conditional  transfer 
instruction  is  clear.  By  definition,  it  is  clear  how  the  second  set  is  used.  But  they  may 
also  be  used  in  conditional  data  processing.  For  example,  Jump  to  location  C  if  one  re¬ 
sponder  is  greater  than  or  equal  to  another  responder.  The  instruction  may  be  used  if  the 
responder's  contents  are  less  than  15  bits  long,  or  if  the  data  can  be  functionally  related 
to  its  GAP  address.  But  to  use  this  instruction  requires  a  transfer  of  address  to  the 
1604-B,  a  1604-B  shift  instruction  on  the  address,  and  load  the  address  into  the  index. 

Similarly,  the  other  test  argument  must  be  entered  into  the  instruction. 

f 

The  main  point  being  made  in  the  preceding  comments  is  that  an  index  register 
is  often  required  to  be  set  to  either  a  datum  address  or  to  an  actual  datum  value.  In  the 
case  of  addresses,  both  read  address  instructions  (RAF  and  RDA)  write  the  address,  a 
1604-B  core  memory  word, in  bit  positions  35  through  24;  whereas  the  15  rightmost  bits 
are  read  into  the  index  register.  Thus,  a  1604-B  instruction  to  shift  the  address  is  required. 

3.3.6  Fixed  Field  Processing 

In  man}'  applications,  data  occurs  in  a  fixed  field  format:  that  is,  the  vaiue  for 
a  particular  class  of  data  always  appears  in  some  relative  field  position  of  a  larger  record. 

In  this  application*  for  example,  a  query,  or  request  for  information,  is  represented 
internally  in  a  fixed  field  form  where  each  "record"  corresponds  to  a  functional  operation 
related  to  the  query,  and  each  record  field  contains  some  parameters  tor  the  operation. 

Each  record  is  of  the  same  format.  In  this  type  of  da  hi  where  an  addressing  relationship 
or  topology  exists, it  was  often  required  to  read  out  the  contents,  or  the  address,  of  the 
word  (maximum  k  was  9)  positionally  related  to  a  responder.  In  like  manner,  write 
operations,  further  test  operations,  erase  instructions,  etc. ,  were  similarly  required. 

H»e  situation  is  similar  to  advancing  and  "retarding"  the  buffer*'}'  k>l  positions.  Another 
way  to  view  this  is  by  setting  an  index  register  equal  to  k  for  an  index  register  which  can  be 
applied  to  GAP  addresses  —  of  course,  none  exists. 
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Another  comment  related  to  fixed  field  processing  is  that  the  mechanism  of 
buffer  advance  does  not  permit  variable  length  fields  unless  it  is  known,  a  priori,  that 
no  more  than  one  responder  may  occur.  The  field  must  be  equal  to  the  number  of  48-bit 
words  required  to  express  the  maximum  word  sized  datum.  If  not,  it  is  possible,  and 
probable,  to  advance  buffers  into  adjacent  fields  and  still  meet  the  search  criteria. 

3.3.7  Instruction  Repertoire 

The  comments  nresented  in  preceding  paragraphs  are  also  applicable  to  other 
instructions. 

Consider  initially  the  search  instructions.  B>  a  large  majority,  the  data  sizes 
in  this  study  (number  of  bits)  are  equal  to  or  exceed  48  bits.  And  for  this  reason  the  com¬ 
plex  search  instruction  was  rarely  used.  In  fact,  it  was  removed  from  two  sections  in 
which  it  occurred  for  these  reasons: 

(1)  The  analysis  required  to  use  the  instruction  is  greater 
than  the  analysis  required  for  data  given  in  word  sizes. 

(2^  It  is  easier  to  "simulate"  the  instruction  by  masks  and 
shifts  thereby  building  in  more  flexibility. 

(3)  It  is  difficult  to  change  the  "instruction"  (a  series  of 
instructions)  since  it  is  interdependent. 

Because  the  greatest  majority  of  data  used  in  this  application  was  equal  to  or 
greater  than  48  l^its,  little  use  was  made  of  the  shift  features  of  the  LDR. 

Several  instructions  read  out  the  address  of  responders,  the  count  of  responders, 
and  the  address  of  the  program  counter.  In  every  instance,  the  balance  of  the  word  containing 
this  information  is  zero  filled.  This  probably  occurs  since  the  comparand  is  used  as  a 
one-word  buffer  in  this  operation.  From  a  programming  viewpoint,  these  instructions 
would  be  much  more  powerful  if  they  "inserted"  the  respective  data  into  the  word  in  1604- B 
core  and  did  not  zero  fill.  In  many  cases  (e.  g. ,  formulation  of  subsequent  LDR  instruc¬ 
tions,  transfer  of  control  operations,  etc.),  this  feature  would  obviate  the  need  for  per¬ 
forming  intermediate  1804-B  operations. 

It  was  found  that  the  use  of  the  busy  bits  could  be  combined  with  the  response 
store  registers,  provided  that  these  registers  could  be  initialized.  This  fact  led  to  an  inter¬ 
esting  and  powerful  supposition:  that  the  response  store  elements  could  be  individually 


addressed,  accessed,  and  written  into  by  use  of  the  current  busy  bit  instructions.  This 
feature  would  erase  a  responder  but  not  "remove"  the  datum  from  consideration. 

3.4  TAG  MEMORY 

The  motivation  of  the  Tag  Memory  is  the  fact  that  GAP  provides  no  direct 
methods  to  save  and  manipulate  response  store  "vectors"  of  more  than  one  test.  An 
initial  approach  in  this  respect  yielded  the  observation  that  there  existed  two  response 
stores  and  the  last  one  could  be  saved  by  using  the  E  register  of  the  upper  memory  as  the 
storage  device.  It  is  then  possible  to  manipulate  the  D  and  E  registers  by  use  of  MMC, 
EMC,  and  MAX  instructions  to  reset,  set,  copy,  and  perform  boolean  operations.  Of 
course,  this  resulted  in  the  loss  of  storage  of  the  upper  memory.  This  difficulty  gave 
genesis  to  what  is  now  termed  the  Tag  Memory. 

The  maior  details  of  the  Tag  Memory  are: 

(1)  Data  is  stored  only  in  the  lower  memory. 

(2)  Tags  are  stored  only  in  the  upper  memory. 

(3)  An  address  relationship,  modulo  1024,  exists  between 
the  memories  by  use  of  the  P  bit.  Thus ,  the  word  in 
the  lower  memory  is  associated  with  its  tag,  the  kth  word 

in  the  upper  memory.  This  is  GAP  memory  word  (1023  +  k). 

A  tag  has  two  elements ,  an  address  portion  of  10  bits  and  a  status  portion  of 
3v  bits.  The  add  css  portion  contains  an  address  of  a  datum  to  which  the  associated  datum 
belongs;  for  example,  an  operation's  parameter  in  the  q^1  word  of  an  operation  record  is 
related  to  the  operation  indicated  in  the  first  word.  Then,  the  address  portion  of  the  par- 
ameterh  tag  (1023  +  q)  will  contain  the  address  of  the  operation.  This  device  is,  program - 
miagwise,  equivalent  to  buffer  advance. 

The  status  portion  of  the  tag  word  is  used  to  reflect  the  status  of  the  element 
associated  with  it,  not  the  status  of  the  element  given  in  its  address  portion.  The  program¬ 
mer  must  decide,  a  priori ,  what  bit  (or  bits)  are  allocated  to  each  status  and  what  the 
setting  of  the  bits  means.  Suppose  for  example  that  the  tenth  status  bit  is  allocated  to 
some  value.  If  the  datum  is  greater  than  or  equal  to  this  value,  the  tenth  bit  is  set;  other¬ 
wise,  it  is  reset  This  may  be  accomplished  as  follows: 

(1)  The  lower  memory  data  is  tested. 
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(2)  T  e  D  register  is  transferred  to  the  E  register. 

(3)  A  full  word  >f  ones  is  written  into  a  responder  through 
a  mask  with  a  set  bit  in  only  that  position  corresponding 
to  the  tenth  status  hit's  position. 

(4)  Complement  ie  E  buffer,  write  a  full  word  of  zeros  through 
the  same  mask,  erase  responders ,  and  complement  the  E 
buffer,  if  required. 

It  is  therefore  feasible  to  save  at  least  38  previous  response  store  vectors  by  use  of  the 
Tag  Memory  by  appropriate  manipulation  of  the  mask  register  (and/or  the  comparand). 
Since  addresses  of  responders  in  die  lower  memory  may  be  stored  in  the  upper  memory, 
it  is  also  possible  to  query  the  range  of  addresses  for  responders  as  well. 

The  Tag  Memory  is  die  best  determined  method  of  using  the  GAP  for  data  depen¬ 
dent  processing.  It  is  recommended  that  it  be  used  whenever  such  processing  occurs. 
However,  such  use  is  aa  expense  of  losing  half  the  memory's  storage  capability. 
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